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Abstract: To asses stability against 1// noise, the Low Frequency Instrument (LFI) on-board the 
Planck mission will acquire data at a rate much higher than the data rate allowed by the science 
telemetry bandwith of 35.5 kbps. The data are processed by an on-board pipeline, followed on- 
ground by a decoding and reconstruction step, to reduce the volume of data to a level compatible 
with the bandwidth while minimizing the loss of information. This paper illustrates the on-board 
processing of the scientific data used by Planck/LFI to fit the allowed data-rate, an intrinsecally 
lossy process which distorts the signal in a manner which depends on a set of five free parameters 
(^aver> ^i, r2, q, O) for each of the 44 LFI detectors. The paper quantifies the level of distortion 
introduced by the on-board processing as a function of these parameters. It describes the method of 
tuning the on-board processing chain to cope with the limited bandwidth while keeping to a mini- 
mum the signal distortion. Tuning is sensitive to the statistics of the signal and has to be constantly 
adapted during flight. The tuning procedure is based on a optimization algorithm applied to unpro- 
cessed and uncompressed raw data provided either by simulations, pre-launch tests or data taken 
from LFI operating in a special diagnostic acquisition mode. All the needed optimization steps 
are performed by an automated tool, 0CA2, which simulates the on-board processing, explores the 
space of possible combinations of parameters, and produces a set of statistical indicators, among 
them: the compression rate and the processing noise eg. For Planck/LFI it is required that 
Cr = 2.4 while, as for other systematics, eg would have to by less than 10% of rms of the instru- 
mental white noise. An analytical model is developed that is able to extract most of the relevant 
information on the processing errors and the compression rate as a function of the signal statistics 
and the processing parameters to be tuned. This model will be of interest for the instrument data 
analysis to asses the level of signal distortion introduced in the data by the on-board processing. 
This method was applied during ground tests when the instrument was operating in conditions rep- 
resentative of flight. Optimized parameters were obtained and inserted in the on-board processor 
and the performance has been verified against the requirements, with the result that the required 
data rate of 35.5 Kbps has been achieved while keeping the processing error at a level of 3.8% of 
the instrumental white noise and well below the target 10% level. 
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1. Introduction 

One of the most challenging aspects in the design of an astronomy mission in space is the ability to 
send the collected data to the ground for the relevant analysis within the allowable telemetry band- 
width. In fact the increasing capabilities of on-board instruments generates ever larger ammounts 
of data whereas the downlink capability is quite constant being mainly governed by the power 
of the on-board transmitter and the length of the time window which can be allocated for data 
down linking [Bertotti, Farinella, Vokrouhlicky (2003)]. In the case of the ESA satellite Planck, 
which will observe the CMB from the second Lagrangian point (L2) of the Earth - Sun system, 
1.5 X 10^ Km far from Earth, the down-link rate is limited to about 1.5 Mbps, and Planck can be in 
contact with the ground station (located at New Norcia, Western Australia) for no more than a cou- 
ple of hours each day thus reducing the effective bandwidth by an order of magnitude. In addition, 
Planck carries two scientific instruments: the Planck Low Frequency Instrument (Planck/LFI), to 
which this paper is devoted, and the Planck High Frequency Instrument (Planck/HFI). Both share 
the bandwidth to download data with other internal spacecraft services and the up-link channel The 
result is that LFI has only about 53.5 Kbps average down link rate while producing a unprocessed 
data rate of about 5.7 Mbps. It is evident that some kind of on-board data compression must be 
applied to fit in to the available telemetry bandwidth. 

It is well known that the theoretical maximum compression rate achievable for a given data 
stream decreases with its increasing variance. Thus it is very advantageous before appying any 
compression algorithm to preprocess the data to reduce its inherent variance. In the ideal case 
the preprocessing would not alter the original data, but in practice some information loss can not 
be avoided when the variance is reduced. Thus the on-board preprocessing algorithm should be 
tunable through some kind of free processing-parameters in order to asses at the same time the 
required compression rate at the cost of a minimal degradation of the data. This paper addresses 
the problem of the on-board processing and the corresponding ground processing of the scientific 
data and the impact on its quality for the Planck/LFI mission. This has also been the topic of 
two previous papers, the first regarding the exploration of possible lossless compression strategies 
[Maris et al. (2000)], and the second focused to the assessment of the distortions introduced by a 
simplified model of the on-board plus on-ground processing [Maris et al. (2004)]. Here the work 
presented by [Maris et al. (2004)] is completed by introducing in Sect. 2 a brief description of the 
instrument followed by a quantitative model of the on-board plus on-ground processing applied in 
Planck/LFI. The processing can be tuned with the statistical properties of the signal and introduce 
as small as possible distortion, to asses the proper compression rate and as small as possible 
processing distortion. This can be performed by using a set of control parameters, as anticipated in 
[Maris et al. (2004)], which are tuned on the real signal. The tuning algorithm, which has not been 
discussed previously, is the most important contribution to the Planck/LFI programme presented 
in this work and it is discussed in Sect. 3. The whole procedure has been validated both with 
simulations and during the pre-flight ground testing. The most signifcative results are reported in 
Sect. 4. Of course, processing has an impact on Planck/LFI science whose complete analysis is 
outside the scope of this paper but however is briefly analyzed in Sect. 5. At last Sect. 6 reports the 
final remarks and conclusions, while some technical details are presented in appendices A, B and 
C. 
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Figure 1. A schematic view of the main flow of scientific data for a single RCA of Planck/LFI. Each 
RCA has two detectors, but in this scheme only the first is represented and schematized. For graphical 
purposes the scheme represents just the first detector while connected to the reference-load, while Detector 
2 would be connected to sky. At a change of the Clock phase the two detectors will switch their connections. 
The block arrows represents the flow of digitized data and telemetry toward the spacecraft and the flow of 
telecommands from the spacecraft. 

2. Radiometer model and acquisition chain 

Planck/LFI [Bersanelli et al. (2009)] is based on an array of 22 radiometers assembled in 11 Ra- 
diometric Chain Assemblies (RCA) in the Planck focal plane. Each RCA has 4 radio frequency 
input lines and 4 radio frequency output lines, hence the number of radio frequency outputs to be 
measured by the on-board electronics is 44. Each feed-horn has one orthomode transducer with 
two outputs: each extracting the two orthogonal components of linear polarization in the signal 
received from the sky and feeding one of the radio frequency input lines of a radiometer, the other 
radio frequency input line is connected to a reference-load held at the constant temperature of 
4.5 K. 

A schematic representation of the flow of information in a single radiometer belonging to a 
RCA is given in Fig. 1 . Each radiometer acts as a pseudo-correlation receiver [Villa et al. (2009)] 
measuring the dilFerence in antenna temperatures, Ar, between the sky signal, Tsky, and the reference- 
load Tioad, [Valenziano at al.(2009)]. However, given the sky and the reference-load have dilFerent 
mean temperatures the reference samples have to be scaled by a Gain Modulation Factor, r, which 
balances the difference between Tsky and Tioad to a mean[Ar] = so that 

Ar = Tsky - rrioad- (2.1) 
A proper choice of r will allow near cancellation out most of the first order systematic errors 
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[Mennella et al. (2003), Mehinold at al.(2009)], assuring in this way optimal rejection of system- 
atics, in particular drifts and the 1// noise [Mennella et al. (2009)]. As a first approximation it is 
possible to put 

mean Tsky + ^noise 

r~ , (2.2) 

mean[rioad] + ^noise 

where Tnoise is the noise temperature. Eq. (2.2) makes evident how different values of r are needed 
in the various phases of the mission. In particular three cases are important: ground tests, in-flight 
cooling phase and finally in-flight operations with the instrument in nominal conditions. As an 
exemple consider the case of the 30 GHz channel, which is the least noisy channel of Planck/LFI 
having an expected Tnoise ~ 10 K. During on-ground testing mean^rsky] ~ mean[7'ioad] and so r ^ 1 
([Bersanefli et al. (2009), Mennella et al. (2009)]). In flight mean[7'sky] ~ 2.725 K but during the 
cooling mean[rioad] varies from ^20 K down to the nominal mean[7'ioad] ~ 4.5 K. Thus r varies 
from ^ 0.4 when the instrument starts to cool-down to ^ 0.88 at the end of the process when it 
reaches its nominal temperature. With higher values of Tnoise the other channels will show smaller 
departures in their r from 1 as well as a lower sensitivity to the environmental conditions. 

To acquire sky and reference-load signals each radiometer has two separate radio frequency 
inputs, and correspondingly two radio frequency outputs, each one connected to a radio frequency 
detector and to an acquisition chain ending in a 14 bit analog-to-digital converter (ADC) housed 
in the Digital Acquisition Electronics box (DAE) [Bersanelli et al. (2009), Villa et al. (2009)]. The 
output of the DAE is sent to the Radiometer Electronics Box Assembly box (REBA) ' which pro- 
cesses the data from the DAE, of interpreting and executing telecommands, and of interfacing the 
instrument with the spacecraft Central Data Management Unit. This unit produces the scientific 
packets to be sent to the ground [Herreros et al. (2009)]. 

The DAE applies a individually programmable analogue offset to each input signal prior to 
applying individual programmablt gains and performing digitization. The contribution to the read- 
out noise budget from the ADC quantization is in general considered marginal. Appendix B dis- 
cusses the case in which this hypothesis is no longer valid. The offset and the gain are adjustable 
parameters of the DAE and it is assumed that their calibration is independent from the REBA caU- 
bration [Cuttaia at al.(2009)] with an exception which is discussed in Appendix C. The ADCs are 
fetched in turn and the data are sent to the Science Processing Unit (SPU), a Digital Signal Pro- 
cessor (DSP) based computer which is part of the REBA [Herreros et al. (2009)] not represented 
in Fig. 1. The SPU stores the data in circular buffers for subsequent digital processing and and 
then appUes the on board software pipeline to the data. In the process the 14 bit single samples are 
convert to 16 bits signed integers. The content of each ADC buffer is processed separately by the 
on-board processing pipeline and sent to ground. 

As usual in these kinds of receivers, the required stability of the radiometers is assured by 
switching each radiometer between the sky and reference-load. Thus each output alternatively 
holds the sky and the reference-load signal (or the reference-load and the sky) with opposed phases 

'LFI has two redundant REBA units, but since they are perfectly equivalent in what regard the on-board data pro- 
cessing, in this paper we will consider LFI as having one REBA only. 
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between the two channels. Hence, each buffer contains strings of interlaced sky — ^reference-load 
(or reference-load — sky) samples in increasing order of acquisition time, t i.e. 



.ADC ^ADC ^ADC ^ADC 
sky,«=0' load,f=l ' sky,f=2' load,«=3' ' • ' ' 



(2.3) 



or 



.ADC j^ADC 7.ADC 7.ADC 
load,l=0' ^ sky,?=l' ^ load,I=2' sky,r=3' 



(2.4) 



The switching frequency is fixed by the LFI internal clock at 8192 Hz. The switch clock gives also 
the beat for the ADCs, which are then synchronized with the switching output, and it is sensed by 
the on-board processor, which uses it to reconstruct the ordering of the signals acquired from the 
ADCs and to synchronize it with the on-board time. This frequency also synchronises the ADCs 
with the input and is used by the SPU to reconstruct the ordering of the signals acquired from the 
ADCs and to synchronise them with the on board time. 

The data flow of raw data is equivalent to 5.7 Mbps; a large amount of data that cannot be 
fully downloaded to the ground. The allocated bandwidth for the instrument is equivalent to only 
53.5 kbps including all the ancillary data, less than 1% of the overall data generated by LFI. The 
strategy, adopted to fit into the bandwidth, relies on three on-board processing steps, downsam- 
pling, preprocessing the data to ensure lossless compression, and lossless compression itself. To 
demonstrate these steps, a model of the input signal shall be used. It has to be noted that while 
the compression is lossless, the preprocessing is not, due to the need to rescale the data and convert 
them in integers, (a process named data requantizzation). However, the whole strategy is designed 
to asses a strict control of the way in which lossy operations are done, of the amount of information 
loss in order to asses optimal compression rate with minimal information loss. 

2.1 Signal model 

We describe quantitatively the kind of signal the pipeline has to process by modeUng the output of 
the DAE as a function of time, t, as 



where Tgky, T'load are the constant part of the signal. ATgky, and ATioad a possible deterministic time 
dependent parts, representing drifts, dipoles, oscillations and so on, Wsky and «ioad represents the 
random noise whose moments are cr^ ^j,^, cr^ and whose covariance is cr„^sky,ioa<i- 

The pipeline described in the following sections needs to be tuned to obtain a proper level 
of data compression which is largely determined by the covariance matrix of the signal whose 
components are 



Tsky(t) = Tsky + Arsky(0 + «sky, 
T\oadit) = rioad + Arioad(0+«load- 



(2.5) 

(2.6) 




var[Arsky]+(r^_,ky 
var[Arioad]+o^joad 



(2.7) 



(2.8) 



C sky.load 



COvl ATsky, ATioad I + O"n,sky,load 



(2.9) 
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Figure 2. Schematic representation of the scientific onboard and ground processing for the Planck/LFI. 
Cyan boxes represent REBA operations, yellow boxes ground operations. Green pads specify the parameters 
needed by each operation. TOI could be produced both in undifi^erentiated form (Tsky, Tioad stored separately) 
or in differentiated form. 



where it has been assumed that the random and deterministic parts are uncorrelated. It is useful to 
identify two extreme cases: the data stream is signal dominated, when varl^Arsky] + var[A7'ioad] » 
'^n sky load' '^^^^ Stream is uoisc dominated, when varj^Arsky] + var[A7'ioad] ^ cr^ ^^y 

cr^ load' ^'^ noise dominated case, the statistics of data will be largely determined by the statistics 
of noise, which in general could be considered normally distributed and uncorrelated over short 
time scales, given the l//-noise will introduce correlations over long time scales. In the signal 
dominated case the statics of data will be instead determined by the kind of time dependence in 
the signal. As an example, if jTsky - T'loadI is large compared to the noise while Ars^y and AT^ioad 
are negligible, the histogram of the signals will resemble the sum of two Dirac's delta functions 
6(x - Tsky) + S(x - Tioad) convolved with the distribution of noise. 

If a linear time dependence of the kind AT(t) = Af + C is present, then the distribution of the 
samples will be uniform and bounded between T ±At/2, where r is the time interval relevant for 
the signal sampling. The variance will be A^/12 where Aj = At is the drift amplitude over the time 
scale T. The signal could be considered noise dominated if t < Vl2cr/|A|. From the point of view 
of data compression, in determining whether a signal is noise dominated or not, the critical factor 
is the time scale r. For our coupled signals, denoting with Asky and Aioad the drift rate in the sky 
and reference-load signals, and with Asky,r. ^load.r the relative amplitudes, the relevant components 
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of the covariance matrix will be 




sky,r 



var 



12 



(2.10) 



var[Arioad]r = 



load,r 

"12" 



(2.11) 



cov 




(2.12) 



12 



In this regard, the most important r to be considered in this work is the time span for the chunk of 
data contained in a packet, which is the minimum unit of formatted data sent by the REBA to the 
ground. Each scientific packet produced by the REBA has a maximum size corresponding to 1024 
octects, part of which has to be allocated for headers earring ancillary informations such as the 
kind of data in the packet or the time stamp. So, even taking into account data compression, only a 
small amount of data can be stored in a packet corresponding to about 6-22 sees, which depends 
on details such as the attained compression rate and the frequency channel involved, as will be 
shown in Sect 2.3. More complicated distributions may occur for a polynomial time dependence 
of the kind Ar oc t'\ or for a sinusoidal time dependence of period P: AT oc sinilnt/P), but in most 
cases a simple linear drift AT oc t could be taken as a reference model given that non periodic drifts 
are bounded in amplitude by corrective actions commanded from the ground station, while periodic 
variations have periods much longer than the time span of a packet. Also in general it is assumed 
that the mean mean|^Arsky] = and the mean[A7'ioad] = but it is interesting to discuss even the 
case in which this is not strictly true. 

2.2 Data compression and on-board processing 

The strategy adopted to remain inside the downlink bandwidth is based on three processing steps: 
i) signal downsampling, ii) signal conditioning and entropy reduction, iii) loss-less compression 
[Bersanelli et al. (2009), Miccolis (2003)]. A schematic representation of the sequence in which 
these steps are applied on-board and whenever possible reversed on-ground is given in Fig. 2. 
The figure refers to a single radiometer chain and is ideally splitted into two parts: the upper part 
depicts the on-board processing with cyan boxes denoting the main steps. The corresponding on- 
ground processing is depicted in the lower part with the main steps coloured in yellow. Green pads 
represents the processing parameters. The first four of them are refered to as REBA parameters, 
and they are applied both on-board and on-ground. The parameters are: the number of ADC raw 
samples to be coadded to form an instrumental sample, A^aver, the two mixing parameters ri, r2, the 
offset O to be added to data after mixing and prior to requantization, and the requantization step q. 
The exact meaning of each of these parameters will be explained later in the text, when each step 
will be explained in full detail. It is important to recall that the on-board parameters are imposed 
by telecommands sent from the ground. They are copied in each packet earring scientific data and 
on-ground they are recovered from the packets to be applied by the on-ground processing. The r 
factor is a parameter of the ground processing and is computed from the total power data received 
on the ground. The final products in the form of Time Ordered Data (TOI) either in total power or 
differentiated are stored in an archive represented by the light-blue cylinder. 
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Before entering into the details of the various steps it has to be noted that in principle a factor 
of two compression would be immediately gained by directly computing the difference between 
sky and reference-load on-board, i.e. sending differentiated data at Earth. Although on-board 
differentiation seems straightforward ^, it implies at least a couple of major disadvantages. First, 
once the difference is made, separate information about the sky and the reference-load is lost, 
preventing an efficient detection and removal of other many second order systematics. Second a 
set of 44 r factors could be in principle easily uploaded on-board and applied to the data, but the r 
for each detector has to be fine-tuned on the real data. This would mean that the optimal r should 
be continuously monitored and adjusted to avoid uncontrolled drifts for each radiometer, but this 
is inpractical, having just 3 hours of connection per day. In addition, an error in calibrating the r 
will cause an irremediable loss of data. Therefore, the best solution is to downlink the sky and the 
reference-load samples separately allowing the application on the ground of the optimal r. 

2.3 Downsampling 

Each sky sample contains the sky signal integrated over a sky area as wide as the beam, but since 
each radiometer is sampled at a frequency of 8192 Hz the sky is sampled at an apparent resolution 
of about 1/2 arcsec. On the other hand the beam size for each radiometer goes from 14 arcmin 
for the 70 GHz to 33 arcmin for the 30 GHz. Consequently it is possible to co-add A^aver, consec- 
utive samples producing averaged samples whose sampling time correspond to a more reasonable 
resolution without any loss of information. 

1 . The downsampling algorithm takes A'aver couples of sky — reference-load (reference-load- 
-sky) samples from a given ADC; 2. separates the two subsets of signals; 3. computes the sum 
of sky and load subsets (represented by 32-bits signed integers); 4. interlaces them; and 5. stores 
them as sky — reference-load (reference-load — sky) couples in an circular buffer for subsequent 
processing. In normal processing the REBA converts these sums into averages by converting them 
into floating-point format and then dividing them by A/aver prior to perfom the subsequent steps 
of mixing, requantization and compression. In the case of diagnostic data processing the REBA 
transfers directly as output these sums as they are i.e. without any other processing or compression. 
In this case the ground-segment pipeline has the task of converting them into averages. This is a 
trade-off between the need for packets to carry just data represented by 16 or 32 bits integers, and 
the need to avoind uncontrolled round-off errors in the conversion of floating-point averages in 
integer values. Note that the diagnostic telemetry is very limited in flight by telemetry bandwith. 

The value of A/aver depends on the beam-width, b^^^, for the given detector 

_ a>spin«overSin^ 
^aver ~ r (^-J^j) 

'^rad/sampling 

Wspin [rad/sec] is the rate at which the satellite spins about its spin axis [The Planck Bluee Book (2005), 
Dupac, Tauber (2005), Maris et al. (2005)], (5 is the boresight angle between the telescope line-of- 
sight and the spin axis, and «ovei = 3 is the the number of samples per beam. Nominal values for 
the A^aver ^rc 126, 88, 53 respectively for the 30 GHz, 44 GHz and 70 GHz frequency channels. 
The corresponding sampling frequencies in the sky are then 65 Hz, 93.1 Hz and 154.6 Hz, while 

^This was the baseline of the on-board processing for [Maris et al. (2000), Maris et al. (2004)]. 
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samples are produced at a rate twice the sampling frequency. This drastically reduces the data rate 
that becomes about 85 kbps without introducing an important loss in scientific information. 

The output of the downsampling stage can be seen as a sequence of sky — reference-load 
couples ordered according to the generation time t 



sky j 



sky 



{ T \ 

sky 

^ T'load J^^Q V ^load J^^^^ \ T'load 



■-26, 



^sky 
T'load 



t=nS, 



where 6t = ZA'aver //sampling the samples are interlaced to generate a string of time ordered samples 
as 

^sky 0' ^load,0' ^sky ,5^5 T'load.S,' ^sky 25,' ^load,2(5,> • ■ • ^ ^sky ,,5^; 7'load,H(5,5 • ■ • 

in a manner similar to the output of the ADC. But, while sky and reference-load samples in each 
ADC output buffer are consecutive in time, this is no longer true for the downsampled values. As 
an example, assuming a sequence from the ADC where even samples are T^'^ and odd samples 
are (i.e., Tsky, Tioad sequences), then any Tsky, will be the sum of A^aver samples with times 
between t and f + 2(A'aver- l)//sampiing while and Tioadr will span the time range t+ 1 //sampling and 
f + 2A^aver //sampling- While this Small time shift is not very important when observing sky sources, 
it might be relevant when attempting to correlate the observed signal with internal sensors, such as 
those used to determine the level of perturbation introduced by the active cooling. However, this 
problem is probably more theoretical than real as internal temperature variations do not occur on 
very small timescales. For simplicity, in the remainder of the text we will omit to specify time in 
our formulas. 

2.4 Lossless compression, packeting and processing error 

To better understand the intermediate step of processing, i.e. mixing and requantization, it is nec- 
essary to introduce here the last step of lossless compression. The familiar technique of lossless 
compression is based on the ability of the compression software to recode a stream of symbols 
by using codewords which on average are shorther than those used in input, and in a way which 
could be fully recovered on ground by a decompression code. The coding for the data stream in 
output to the compressor has to be optimized by taking into account the statistical distributions of 
the symbols in the input data stream. For this reason, lossless compressors maintain an internal 
representation of the data distribution, such as the histogram or similar statistical indicators. In our 
case the selected compression scheme is based on a 16-bit, zero order, adaptive arithmetic entropy 
encoder [Herreros et al. (2009)]. The compressor assumes that the data stream is represented by 
an uninterrupted list of couples of 16-bit integers. It does not take any particular interpretation 
of the content of the samples or of the order in which they are presented. It simply keeps coding 
and storing data in the packet until the maximum length is reached. The packet is then closed and 
a new packet is opened. The compressor uses an adaptive scheme to decide the best coding for 
the input data as they are produced by the previous steps of the on-board pipeline. On ground the 
decompressor extracts the samples from each packet in the same order in which they have been 
introduced by the compressor. In this sense the compressor/decompressor couple act as a First In - 
First Out device, and becomes nearly transparent in the scientific processing of the data. 
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The basic requirement for the packets produced by the compression stage is that of packet 
independency i.e. it must be possible to interpret the content of each packet independently of all 
the others. For LFI it means that the pipeline in the ground segment shall be allowed to generate 
from each single packet chunks of differentiated data. So the compressor must store consecutive 
couples of sky — ^reference-load samples within each packet together with the information needed 
by the decompressor to interpret the compressed packets. In addition the compressor must be able 
to self-adapt its coding scheme to the statistics of the input signal, without the need of any prior 
information on it. Finally, the compressor must be fast enough to allow real-time elaboration of 
data with hmited memory consumption. These requirements suggest the use of a compression 
scheme in which the compressor updates its internal statistical table each time it receives a sample. 
An empty statistical table is then imposed at the begiiming of the compression of a new packet, 
therefore assuring complete independence. When a symbol not present in the table is received as 
input a pseudo-symbol corresponding to a "stop message" is issued, followed by the uncompressed 
new symbol, after that the internal statistical table is updated. If the symbol is in the table, the 
corresponding entry is updated and the symbol is coded accordingly. On ground the decompressor 
starts with the same empty internal statistical representation assuming the first symbol is a stop 
followed by a new symbol, and it updates the table accordingly as it receives symbols to decode or 
stop symbols. 

The efficiency of a compressor is typically measured by the, so called, compression rate Cr 
defined as the ratio between the length of an output string Lout derived from the compression of an 
input string of length Ljn 



Of course, to accomodate a given data rate i?data inside a given bandwidth Bdata a target compression 
rate has to be obtained leading to the obvious definition 



It is well known that any lossless compressor based on entropy encoding has an upper hmit for the 
highest compression rate 



where Ambits is the number of bits used for coding the samples and H is Shannon's entropy for 
the signal, which in turns depends on its probability distribution function (PDF). For an optimal 
compressor the theoretical C^^ for a digitized signal represented by integers in the range Q mm < 
Q < Qmax is given by 




(2.14) 




(2.15) 



r 



.Th 



Ambits 

H 



(2.17) 




(2.18) 



e=emi„ 
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where H is the Shannon entropy for the data stream, fg is the frequency by which the symbol 
or value Q occurs in the data stream, having lim/g_>o/2log2/e = 0, and lig™g . /g = 1- Non- 
idealities in the signal and in the compressor cause the effective to be different from the expected 
C^"" having C^"" > C^. Usually this is accounted for by scaling C^^ by a multiplicative efficiency 
factor Tj. However its exact determination is a complex task described in some detail in Sect. 3.4 
and for the time being we will neglect it. 

From Eq. (2. 17) and (2.18), to maximize we need to minimize H for the input signal, forcing 
the reduction of its variance by requantizing the data. I.e. dividing the data by a quantization step, 
q, and rounding off the result to the nearest integer 

Q= round l^i^j, (2.19) 
where O is an additive constant usually defined by asking 

mean[Z + C)] =0. (2.20) 
On ground the data are then decompressed and reconstructed by multiplying them by q. 

X = q[Q-Ol (2.21) 

Some information is lost in the process and causes a processing distortion, eq, which in the simplest 
case is approximated by 

e„ = rms[l - zl « . (2.22) 
L J Vl2 

In [Maris et al. (2000), Maris et al. (2004)] we studied the case X = AT, there it had been shown 
that for Planck/LFI the statistics of the differentiated data stream was approximated by a nearly 
univariate normal distribution with cr = rms[A7'], and that after re quantization and reconstruction 
both Cj^ and where largely parameterized by the cr/q ratio with 



Th ^ f'bit^ ■ (2.23) 

log2(V2^f) 



1 (cr 



-1 



(2.24) 



of course we need to assure that e^/cr < 1 which is expected to be satisfied "by design" for 
Planck/LFI. In this regard, it has to be recalled how the limit to any instrumental residual error for 
Planck/LFI was assessed in the context of the overall error budget (including thermal, radiometric, 
optical and data-handling effects), driven by the ultimate requirement of a cumulative systematic 
error per pixel smaller than 3 //K (peak-to-peak) at the end of the mission. The 10% limit for the 
on-board processing-related errors has been set as a reachable requirement which should lead to a 
nearly-negligible impact on science. 

We are now in the position of deriving the expected time spans for the compressed chunks 
of data contained in Planck/LFI packets which have been reported at the end of Sect. 2.1. It 
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is sufficient to consider that each packet may carry a maximum number Mmp of Ambits code-words 
representing scientific data, each representing on average cj^^ samples either sky or reference-load 
and that the sampling period after the downsampling is /sampling /^aver to obtain 

^^'aver'''smp>-r 

Tpck ~ 7 , 

/sampling 

where the factor of 2 in front of A^aver comes from the fact that the sky-reference-load cycle has 
half the frequency of the ADC sampling. For Planck/LFI Mmp = 490, while a cj^^ = 2.4 would 
be sufficient to allow proper data compression. Of course, some level of variability among the 
detectors has to be allowed in order to cope with non stationarities in the time series, or with the 
need to share the bandwidth among different detectors in different manners. So a good fiducial 
range of values for cj°^ for individual detectors is 2 < C^^' < 3, leading the expected values for 
Tpck to vary over 15 - 22 sec, 10 - 16 sec and 6-9 sec respectively for the 30 GHz, 44 GHz and 
70 GHz frequency channels. 

2.5 The mixing algoritlim 

In general a data stream made of alternate sky and reference-load samples can not be approxi- 
mated by a normal, univariate distribution. Two different populations of samples, with different 
statistical properties are mixed together. In this case the Cr could be reduced with respect to the 
univariate case. Furthermore, most of the first order instabilities, such as drifts and 1 //-noise, 
come from the radiometers, produces spurious correlated signals in Tsky and Tioad- For these rea- 
sons undifferentiated time-lines for Tsky and Tioad are much more unstable than the corresponding 
Ar timelines further reducing C^. In particular, fast drifts may rapidly force the compressor to 
saturate the packet filling it with the decoding information, in the worst case resulting in Cr < 1 . 
According to Eq. (2.23) It is possible to increase q to keep Cr within safe limits, but the log2 depen- 
dence will drive eq/cr to rapidly grow towards e^/cr;l. Alternatively, a more complex compression 
scheme could be implemented, which takes into account the sky-reference load correlation. But 
this would be computationally demanding and would increase the amount of decoding information 
to be placed in each packet. 

One is left with the need to recover the advantage of differentiated data, i.e. reduced instabil- 
ities and more homogeneous statistics, without losing the opportunity to have sky and reference- 
load separately on ground. The adopted solution is inspired by the principle of the pseudo- 
correlation receiver. Instead of sending to ground (T^sky, Tioad) couples, LFI delivers {Ti , T2) couples 
where each Ti, T2 is an independent linear combination of the corresponding Tsky and Tioad- Cou- 
ples are then quantized and compressed. On ground data are decompressed, dequantized recovering 
the original data [Miccolis et al. (2003)]. The most general formula for the linear combinations is 



^ 1 



Ml. sky Ml, load 
M2.sky M2,load 



7'sky 
^ ^load 



(2.26) 



here the matrix, M, in Eq. (2.26) is named mixing matrix (actually it represents a mixing and a 
scaling unless |M| = 1), its inverse is the corresponding de-mixing matrix. The demixing 
matrix is applied on ground to recover the string of (T^sky, ^load) out of the received string of (ri , T2), 
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which imposes |M| + 0. The structure of M determines the kind of coding strategy. A particular 
structure for M could better fit a given subset of constrains rather than another. Both the Cr and eq 
are determined by q as well as M. In particular it is obvious that the processing distortion will have 
the tendency to diverge for a nearly singular M. A detailed analysis of the whole set of possible 
structures for M is outside the scope of this paper, but in general M shall be optimized in order to 
i) equalize as much as possible the T\ and T2 statistics, ii) reduce as much as possible the efi"ects of 
first-order drifts, iii) maximize the Cr, iv) minimize eq. For Planck/LFI the following form for M 
has been selected. 



M = 



(2.27) 



1, -r2j' 

|M|=r2-ri; (2.28) 



1 ( 
M-i = — 



ri-r\ 



ri, -n 
1, -1 



(2.29) 



which is not completely optimal, since it allows optimization only on a subset of possible cases, but 
has the advantage of having a reduced amount of free parameters to be uploaded for each detector ^ 
and it is directly suggested by Eq. (2. 1). Since |M| = r2 - rj , for any given q the distortion increases 
when \r2 - ri\. In nominal conditions mean|^rsky] = 2.735 K, mean[rioad] = 4 K, and a possible 
choice for ri and r2 is ri = 1, r2 = r = 0.85. But a tuning procedure is required to determine the 
best parameters for each radiometer. 

The effect of mixing with respect to both signals and distributions is illustrated in Fig. 3 which 
refers to the case of a data-stream which is signal dominated (see page 5). In this figure dashed- 
lines represent the input signals, full-lines the corresponding mixed signals, dotted-lines the limits 
for the variability induced by the noise. So the ramp in frame a) of Fig. 3 represents a model 
signal for 7'sky(f) (blue dashed line) and rioad(0 (red dashed line). The corresponding interlaced 
data are shown in Fig. 3b (blue dashed-line), where the limits of variability induced by noise 
are not represented in order to avoid confusion. The time lines for Ti{t) and T2(t) calculated for 
rj = 3/4, r2 = 1/2, are represented in Fig. 3a and Fig. 3b as full-lines, and they are shifted to 
avoid overlapping with the previous plots. The reduction in the variance is associated with drifts in 
the mixed data is evident. Mixing transforms the bi-variate PDF which is for Tsky, Tioad signals 
into that for Ti, T2. Fig. 3c represents its effect on the bi-variate PDF for the noise and the drift. 
Looking at the normal distributions of randomly variable signals in Tsky and Tioad, g(7'sky,7'ioad), 
the line for which g(7'sky,7'ioad)/g(7'sky.7'ioad) = 1/2 is a dashed line, and the equivalent line for 
g{T\,T2) is a full-line. The distribution for the deterministic signal (either a ramp, a drift or a 
triangular wave) is represented by a segment, plotted again as a dashed line to denote the Tsky, Tioad 
signal and as a full line to denote the T\,T2. The effect of mixing is a combination of a non-uniform 
scaling, a rotation and a shift. The circle transforms into an ellipse. The line changes its tilt and 
length. In other terms, the covariance matrix for mixed data will be different from the original ones. 
A very interesting consequence is in the case of a normally distributed noise a correlated noise will 



^Packets independency imposes that all the free parameters {N-Ma, q, O, r\ and r2) have to be stored within each 
packet. 
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a) b) 



Interlaced Time Series 




Figure 3. The effect of mixing and interlacing on time series (top frames) and distributions (low frames), in 
the output to the radiometer. Frame a) is an input time series, for sky (red dashed-line) and reference-load 
(blue dashed-line), and the corresponding mixed quantities Qi (red full-line), Q2 (blu full-line) after mixing 
with ri = 3/4, /•2 = 1 /2. The range of values allowed by noise within ±5cr are represented in both mixed and 
not mixed quantities by the upper and lower dotted lines. The input time series has identical drifts on sky and 
reference-load equivalent to several noise crs and corresponds to a signal dominated time serie as defined in 
Sect. 2. 1 . Frame b) are the signals as seen from the compressor after interlacing, full-line without mixing, 
dashed-line after mixing. Frame c) is the effect on a normal distribution and on a ramp, green before mixing 
and red after mixing. Frame d) shows the effect on the projected distributions after interlacing. 



appear in the mixed space even if the input noise is not correlated. In general after mixing the 
major axis of the two figures has the tendency to align with the x = y line and the center of the two 
figures shifts. In this case |M| = r2 - ri = I and the size of the two figures changes proportionally to 
|M|. Interlacing transforms the bi-variate PDFs into univariate ones. Fig. 3d represents the effect 
of mixing on the PDF of interlaced data. Again, dashed lines represents the distributions before 
mixing and the full-lines after mixing. As in Fig. 3b red indicates Tioad or Ti and blue indicates 
Tsky or 7^2. The bottom part of Fig. 3d represents the resulting distribution of interlaced signals, 
before (dashed) and after (full) the mixing. 

How these distributions have to be decomposed in terms of the projected distributions is shown 



-14- 



in the top part of Fig. 3d, which shows separately the distributions of the random and deterministic 
components, respectively a normal and a box distributions for Tsky and Tioad, The resulting distri- 
bution will be the convolution of the two, for T^ky and Tioad are very similar to a box distribution. 
Of course the drift makes the overall signal non Gaussian, in particular for Tioad- The central part 
of Fig. 3d is the equivalent for the mixed signal. Here the drift is reduced and the convoluted sig- 
nals are more similar to the original normal distributions of noises. I.e. mixing not only reduces 
the distance between the two components but, by reducing the drift, make them more normally 
distributed. 

After mixing, the {T\,T2) couples are re-quantized produce the quantized couples (Qi,Q2) 
which are interlaced and sent to the compressor 

Qi = round|^^^J, i = 1,2; (2.30) 

where (9 is an offset introduced to force (Qi, Q2) to stay within the range [-2'^, +2'^] ^. On ground, 
packets are entropy decoded, the data streams are de-interlaced and the corresponding iQi,Q2) are 
used to reconstruct the sky and reference-load samples 

fcr=Yj^a!iiq'^Qi-'->^^^ a = sky,load, (2.32) 

(=1,2 

where are the components of M~\ and the tilde over a symbol "x" is used to distinguish a 
reconstructed quantity out of a processed one x. 

Mixing will map sky — reference-load statistics in the corresponding mixed statistics 



Ti = Tsky -r,Tioad, i = 1,2; (2.33) 
A^i = A^sky - r,Arioad, i = 1,2; (2.34) 

O"? = O'sky + '■fo'Ld - 2riCr,ky,load; (2.35) 

2 2 2 ''1 + '"2 o^N 
0"l,2 = O'sky + '•20-ioad ^0"sky,load- (2.36) 

A simplification for the components of the covariance matrix can be obtained by assuming cTsky 
as unitary, then defining ta = T„/crsky, a = sky, load; and incorporating Ru- = cTsky/crioad in the r, 
factors. This gives these normalized parameters 



Af = f,ky-rfioad (2.37) 
d-j = l+rf-lriQsi; (2.38) 

d-^j = l+rirz-^^^fti- (2.39) 



''As anticipated in Sect. 2.3, to reduce the roundoflf error, the division by A'avei- is applied generating (Q\,Q2), in 
addition the parameter for digitization is not q but Sq = 1 Iq, so that Eq. (2.30) shall be written 

IS„iTi+0)\ 

e, = roundP4^ ,< = 1,2; (2.31) 

however for consistency with [Maris et al. (2000), Maris et al. (2004)] in the following we will omit the division by 
Waver and we will continue to use q in place of 5q. 
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where g^i = crsky,ioad/(o"skyO"ioad)> = rJRcr. The corresponding transforms for the expectations is 
more complex. Of course Ta = = sky, load, but 

'-'sky 



fi = f,ky-i?,rr/fio,d, i = l,2; (2.40) 
Afi = Afsky -^crr,Afioad, « = 1,2. (2.41) 

(2.42) 

However these normahzations are very useful in discussing the compression rate, especially after 
having defined the obvious q = q/os^y. 

2.6 Modelling the statistical distribution of processed data 

We want here to define an approximation able to asses Cr > Cj^ in a simple way. For this reason 
we need to model the entropy for the signal entering the compressor. Of course the accuracy to 
which it is possible to predict the final Cr is directly connected to the accuracy to which the entropy 
is predicted. In the following we present two approximations for the entropy of the signal, a lower 
accuracy approximation and a high accuracy approximation. 

2.6.1 The low accuracy approximation 

Considering the usual reference cases of a noise dominated signal and a signal dominated by a 
linear drift, in the first case the PDF can be approximated by a normal distribution, in the second 
case the PDF can be approximated by a uniform distribution with fg = q/A values and cr = A/ Vl2. 
But in any case for cr/q » 1 

i/ = log2fepdf-, (2.43) 

with fcpdf a constant depending on the type of p.d.f. ranging from VT2 for a uniform distribution to 
y/lne for a normal distribution. The difference in H between these two extreme cases is 0.25 bits. 
The argument of the logarithm is the number of symbols in the distribution. So H may be written 
also as H = log2A^s^^ with N^^^^ = k^alq. Of course in the case of the uniform distribution 
^s^b ~ ^symb- The PDF for the interlaced signals gives the probabiUty to have a symbol Q either 
from processes Q\ or Q2. Then 

'Pi{Q)+'P2iQ) 

nO) = \ (2.44) 

with fiiQ), i = 1, 2 the marginal PDF for the Q, drawn from the bi-variate PDF f (Qi , Q2). For 
our extreme cases both !P,(0 are uniformly distributed or normally distributed according to the 
original PDF from which they are drawn ^. This allows one to neglect, in estimating the entropy 
of the interlace signal, their mutual correlation. Then the entropy for the interlaced data is just a 
function of the RMS for the two distributions cri, 0-2, and their separation, Adistr, 



'In this case the central limit theorem does not apply to the signal with a uniform PDF given its deterministic nature. 
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2 Effzl-Effil 
Adistr = ^ ^ (2.45) 

Kpdf O"! + 0"2 

which is a normalized measure of the distance between the two peaks. After some algebra 

Adistr = 2^^^ (2.46) 

Kpdf 0-1+0-2 

Then the entropy will be just a function of &{, 0-2 and Adistr. An exact analytical expression 
for H can not be obtained for this case. However, it is easy to see that in the limit |Adistrl » 1 the 
entropy takes the limiting value 

/f. = ^^ + l, (2.47) 

giving 

/fco = log2(^pdf ) + log2( y/d-id-2) - log2 q+l- (2.48) 

On the other side if Adistr = and (Ti = 0-2 the two PDFs collapse giving Hq = Hi = H2- In all the 
other cases Hq < /f(Adistr) ^ ^^c». The important point here is the assumption that 

H ^ H^, (2.49) 

would never overestimate the entropy by more than 1 bit or ^ 30%. Therefore neglecting the 
compressor inefhciencies, a sufficient condition to asses > Q would be 



with/ftgt=A^bits/cJ^',or 



H^ < /ftet (2.50) 



^Jd-id-2<q—, ; (2.51) 

^Kpdf 



and so 



j 2™''"" r 

m+f^^- 2ri£)si)(l +rl- 2r2£)si) < q—r. ; (2.52) 

^ 2A:pdf 

Eq. (2.49), Eq. (2.50), Eq. (2.51) and Eq. (2.52) represent our low-order approximation for the 
optimization of REBA parameters. In particular, Eq. (2.52) puts a lower limit to q (and q) for a 
given Cr. In fact for fi = f2 = 0, q must be larger or equal to 

qr:.niC?') = ■ (2.53) 

As shown in Fig. 4 for Qsi = 0, the left side of Eq. (2.52) in the (ri,r2) plane has a minimum 
at (0,0). Its iso-contour lines are closed, all centered on the origin, having four axis of symmetry 
ri = 0, r2 = ±ri, and r2 = 0. The maximum distance from the origin of iso-contour lines occurs for 
rj = or ^2 = and the minimum occurs along the f2 = +ri line. Changing g^i = toward negative 
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Figure 4. Iso-countour lines of the left side of Eq. (2.52) for psi = -1 (left), Qsi = (center) and gsi = 1 
(right). In the first case the function is minimal in the (-1,-1) point, in the second in the (0,0) point, while 
in the third in the (1,1) point. 

or positive values the iso-contour lines are again closed but their symmetry changes taking a more 
"cuspidal" shape, which is symmetrical about the fi = g^i and the r2 = Qsi lines. In any case, the 
value of the function decreases near the r2 = ri= Qsi point where it has a minimum. From the figure 
it is evident that when converting (h,r2) to (ri,r2) a larger q/qmm ratio or a smaller crioad/csky 
increases the size of the region enclosed by each contour. Eq. (2.52) and Eq. (2.53) define in this 
low order approximation the optimal q for which Cr = C^^' 



from which ^opt is simply derived as ^opt - o"sky§opt- This equation does not constrain completely 
^opt and for this reason we have to take into account the processing error as explained in Sect. 2.7. 

2.6.2 The high order accuracy approximation 

The accuracy by which ^opt is determined by Eq. (2.54) is solely determined by the accuracy of 
imposing H = Hoc- Given the statistics of the input signal it would be not a problem to calculate by 
numerical integration /f as a function of r\ , r2 and q. But of course this would be quite expensive 
from a computational point of view. For this reason a high accuracy algorithm to compute H is 
derived Appendix A using simple equations from which ^opt could be readily obtained. However 
from the conceptual point of view the high accuracy method does not introduce any new detail 
in the discussion, and therefore the remaining part of this section refers only to the low-accuracy 
method unless otherwise stated. 

2.7 Processing error of the mixing/demixing algorithm 

The most important way to quantify the processing error is the measure of the distortion in the 
undiff"erentiated or difl'erentiated data. The statistics of such distortions are taken as metrics of the 
quality of the process. For the undifferentiated data 




(2.54) 




(2.55) 
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a = sky, load. By following the methods of [Maris et al. (2004)] from Eq. (2.32) and Eq. (2.55) it 
is easy to derive the covariance matrix of the quantization error, Eq,afi = cov[^5„,(5ygj 



E ^r_}^(r-{ + 4,ri+r2 
I2(r2-n)^[ri+r2, 2 

The distortion of differentiated data is instead expressed by 



.2 1 t ^2 , a 

f 1 -t- f 1 

(2.56) 



^diff = (Tsky - fTioad) - (T'sky " ''T'load); (2.57) 

where f is the r determined on the processed data, which in general will be slightly different from 
the r determined on the original ones. However, assuming r ^ r from Eq. (2.56) the variance of 
Sm is 

The first important fact which has to be stressed is that the variances of both errors are pro- 
portional to I{r2 -r\f-. Of course a nearly singular matrix with ~ ri will result in very large 
errors. In addition, Eq. (2.56) shows that, despite quantization errors for Q\ and Q2 are uncorre- 
lated, application of demixing causes processing errors in T^-^y and Tioad to be correlated unless 



ri+r2=0. (2.59) 

However, expanding the numerator of Eq. (2.58) produces e^^:^^ocr^ + r^ + 2r^ -2r{r\ +r2) suggest- 
ing the important result that a not null correlation in the quantization errors may lead to a reduction 
of the error distortion in the differentiated data 

Another very important case is ri = r or r2 = r. In this case Eq. (2.58) reduces to 




(2.60) 



which is the same result we would have got quantizing differentiated data ([Maris et al. (2004), 
see]). This fact has been used in the first version of the optimization software, designed for the 
first run of the ground tests (the RAA tests described in [Bersanelli et al. (2009)]) to increase its 
speed, together with the fact that and €g are not sensitive to an interchange of ri and r2 but rather 
to \r2 -ri\. However, in the subsequent tests the more general and accurate procedure described 
here has been successfully applied. 

2.8 Saturation 

Saturation occurs when the argument of the y = round[x] function exceeds the maximum range of 
values allowed by the computer to represent the results. Indeed y = round[x] returns an Ambits signed 
integer. If \x\ > 2'^''"^^- an overflow or an underflow will occur. Depending on the implementation of 
the y = round[x] function the value of y could be either forced to be ±2^^ with the sign depending on 
X, or modular arithmetic could be applied so that as an example a too large x > could be mapped 
into y < 0. In all cases the whole subsequent reconstruction will produce meaningless results. So it 
is fundamental to avoid saturation. 
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a) 



b) 



SKy(0,1,0);Load(0.1,0) 



Sky(0,1,0); Load(0,2,0) 




c) 



Sky(-2,1,D);Load(D,1,0) 




d) 
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Figure 5. Example of analysis of Q^cY factors in the (r\ ,r2 - r\) space. Various cases for different values 
of Tsky and Tioad are considered. The yellow region is the allowed region when all the (3ack,( < 1, the blue 
region is the forbidden one. Thin dashed lines are the limits of allowed regions (3ack,i with positive (red) 
or negative (yellow) drifts and Qack,2 for positive (green) or negative (violet) drifts. Thin full lines are the 
allowed regions for (3ack,i and Qack,2 whose intersection is marked with a white thick line. Values of Tsky, 
cTsky, ATsky and the corresponding for Tioad are between parentesis in the title of each frame in the order 
(mean, sigma, drift). 



The level of filling of the allowed dynamical range is measured by the instantaneous Qack ratio 



Saturation occurs if at some time |Qack,/(OI ^ 1 and the non-saturation condition is 



(2.61) 



iack,i(f)l<iAiack,2(0l<i; Vf 



(2.62) 



^From Quantization Alarm Check. 
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Sky(-2,1,D);Load(2,1,0) 



Sky(-20,1,25);Load(20,1.25! 
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c) 



Sky(-20,1,D);Load(2D.1,25) 



d) 




Sky(-20,1,25); Load [20,1 ,-25) 



r 
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Figure 6. Example of analysis of (Sack factors. See caption of Fig. 5 for explanation of symbols. 

in general this will put limits on ri, ^2, q and O. Assuming to have applied the optimized offset of 
Eq. (3.1) the linear combinations are 



<3ack,l(0 = 



<aaek,2(0 = 



ArskyCO - ri ArioadCO + 7' load ± n(Tn,\ 

Arsky(0 - r2Arioad(0 - ^^T'load ± ncr«,2 



(2.63) 
(2.64) 



where fi ^ 5 is used to assess a safety region against random fluctuations. 

In computing Qack the eff'ect of mutual cancellation of extremal values must be considered. A 
conservative estimate would be to propagate the modulus of each variation 



max(A7',) = max |Arsky| + |r,| max [Arioadl + 



l^ioadl + no";, 



(2.65) 



with min(A7',) = -max(Ar,), but it is better to explore the various combinations of minima and 
maxima within Eq. (2.63), producing a set of partial Qack indexes which have to be independently 
satisfied. An example of such method is illustrated in Fig. 5 and Fig. 6. 
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In general, the separation between Ti and T2 is a function of time, whose measure is given by 
the divergence VT, a parameter just sensitive to AT^ioad and Tioad 

VT{t) = -2 ; — (2.66) 

Kpdf (o"i + 0-2) 

of course |Vr| will be constant when A^ioad = 0. 

To determine the region of parameter space ri, r2 which satisfies Eq. (2.62) it is most conve- 
nient to work in the (ri , r2 - ri ) space, there the most general condition is 



ri(a-l)-b <r2-ri< ri{a-\)-c\ (2.67) 
c-(r2- n){a - 1) < (fl - l)ri <b-{r2- n){a - 1); (2.68) 



with the dimensionless coefficients 

^ /2A7Vd^jV ^2.69) 

\ ^load / 

bi = J— (Arsky + g2^^«^-' ± no) ■ (2.70) 

T'load 

c± = (Ar^ky - ^2^^--i ± ncr) ; (2.71) 

7'load 

cr = max(cri,cr2); (2.72) 
For n = those conditions define a diamond-shaped region whose vertices are 

A: (£±^,-^), B: f^,o), 

V a(a-l) ' o /' Vfl-l' /' 

V a(fl-l) ' /' Vfl-l' /' 

Note that B : and D : lie on r2 - ri =0 line, while A : and D : are above and below it, the exact 
ordering depending on the signs. The center of the diamond-shaped region is locate on ({b + 
c)l(a- 1),0). If + c = the region is centered on the origin of the Cartesian system, A :, C : 
and fi :, Z) : are mutually opposed. In the case a = 1 the region degenerates into a band parallel to 
r2-r\ = 0, and bounded by -b <r2-r\ < -c. 

First considering the stationary case A^sky = 0, Arjoad = 0, then a = 1, i> = -c the constraints for 
M = 0. In this case Qack,i and Qack,2 defines the same condition 

|r2-ri|^<.yAf.at. (2.73) 

identifying simply a band around the ri = r2 line. The effect of noise is to put n > so that a+ and fl- 
are different and Qack.i defines a vertical band and Qack.2 a diagonal band whose intersection is the 
diamond-shaped region above of Fig 5a. Changing the ratio between crn,2/o'n,i will not change the 
shape but just the size of the region. Fig 5b. Changing Tsky and Tjoad instead will change the shape 
of the allowed region as shown in Fig 5c, Fig 5d and Fig 6a. If Tioad = the saturation condition 
becomes |1 - r,| • |A(OI < ^A^sat i.e.: 1 - ^A^satMmax < r,- < 1 + ^A^satMmax, with A^ax = max(|A(0|). 
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This defines a rectangular region with diagonal ri =r2. In the limiting case for n = 0, Amax ^ the 
allowed region becomes the whole plane, while in the opposite case Aniax — ^ ^ th^ allowed region 
shrinks toward ri = r2 = 1 . 

Perturbations in the sky channel, such as the cosmological dipole, introduce a fluctuation which 
affects just the sky, in this case Arioad = and a = I. Even here the simplest cases are Tioad.o = 0, 
or the forbidden ri = r2. So max(|Z)(f)|) < qN^at is a sufficient condition which puts a limit just on 
q. In the most general case from Eq. (2.67) the limits of the allowed region are -2Arsky/7'ioad,o - 

2qN sat /Tload.O <r2-r\ < -2Arsky/7'ioad,0 + 2^A^sat/7'load,0 

Other effects, such as instabilities of the 4-K reference-load, could affect just Tioad and ^T^^y = 
0, b = 2qNsat/Tioad.o = -c Even here the simplest case Tioad = 0, max(|A(?)|) < qNsat is a sufficient 
condition which puts a limit just on q. In general a diamond-shaped region symmetrical around 
the origin represents the allowed region, as shown in Fig 6c. 

Drifts in the gain of the amplifiers in the radiometers, such as those produced by thermal 
effects, add correlated or anticorrelated signals in sky and reference-load. So the model case to 
be considered is the one in which A(f) ^ ATs^yit) ~ Arioad(0- In this case with a = (b + c)/2 and 
Eq. (2.63) and Eq. (2.66) define the regions in Fig 6b and Fig 6d. 

Before to conclude, it is interesting to consider the range of values assumed by fi and ^2 
when A in the range Aiow <A< A^p then \Pixsp - Pi,Low\ = l^up -^lowl • |1 - r,]. To have identical 
ranges, is ri r2 and r2 + ri = 2 or (r2 - ri) = 2 - 2ri . 

In nominal conditions, Tsky.o ~ 2.73 K, Tioad.o ~ 4 K while Tsky, Tioad fluctuations are expected 
at the level of at most some 10"^ K giving A7'ioad/7'ioad,o ~ Arsky/rioad.o ~ some x 10"^, and hence 
a~l,bx -c. But this condition could be severely violated at ground during the ground tests, or in 
flight while the instrument is cooling down. 

3. Optimizing the On-board Processing 

The optimization of the algorithm consists in determining the "best" combination of the set of 
processing parameters i.e. the "best" n-tuple A^aver, r\, r2, O, or q. It is mandatory that the 
optimization procedure will keep within safe limits Cr = 

The classical approach would require a function of merit and a searching algorithm through 
the corresponding NxR^ parameters space to be applied to each of the 44 detectors. However a 
reduction of the cardinality of space comes from the fact that, by-requirement, the nominal A^aver 
is fixed by the oversampling factor for the beam, so apart from the cases in which a different 
oversampling is required, the A^avei in nominal conditions is fixed. The only cases in which A^aver 
could be varied are: i.) sampling of planets for beam reconstruction; ii.) ground testing and 
diagnostics. The first case occurs when the beam has to be reconstructed with higher detail than the 
one reachable with the nominal oversampling factor Mover = 3. So it is possible to ask the on-board 
processor to decrease A^aver increasing proportionally the data-rate from the feed-horns which will 
be affected by a planet. To arrange the higher-throughput of scientific telemetry, q will have to be 
increased, increasing proportionally e^. In the second case the value of A^avei could be varied either 
to increase the time resolution, as an example if sampling of some perturbation characterized by 
time scales compatible to A^aver //sampling has to be investigated, or if some temporary shortage in 
the telemetry rate is imposed, asking in this case to increase A^avei - Also while testing on ground 
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for long term drifts, the sky is replaced by a dummy load at constant temperature. In this case 
time resolution is no longer an asset and A^avei could be increased. A further reduction of the 
parameter space to be explored comes from the fact that usually O is optimized in order to have 
mean[rinteriaced + <5] = whcrc ^interlaced are the interlaced samples produced after by Eq. (2.27). 
It is then easy to derive that mean[rinteriaced + = mean[ri] +mean[r2] + 20 so that Ooptimai = 
-(mean[ri] +mean[r2])/2, and with some simple algebra 

(^optimal = ~7"sky H T'load; (3.1) 

where the mean has to be computed over a suitable time span. What remains is a ^R.^ parameter 
space to be explored {r\,r2,q). 

3.1 Target function 

The target function ;^^(ri,r2,^) for the optimization would i.) asses Cr = cj^' to be kept within safe 
limits; ii.) asses eq to be kept as small as possible; iii.) asses additive constrains. These constrains 
do not allow a unambiguous definition of a target function. As an example, even for a stationary 
signal dominated by white noise, Cr computed on each packet is a random variable. So the question 
is whether = cj^^ has to be interpreted strictly, i.e. forcing each packet to have Cr = cj^^ or on 
average leaving space for lower and higher Cr? In general it would not be critical if some fraction 
of the packets would be compressed at a rate lower than Cr The requirement on is even worse 
defined. Of which €q are we speaking? As shown in sect. 2.5 it is evident that there is not a general 
definition for e^. Depending on the scope of the data acquisition it could be more interesting to 
have a low for Tsky or Tioad or Ar computed for some reference r. More over neither nor 
fg.diff are functions with a minimum and they vary over the full range of positive values. In addition 
within a pointing period repeated sky samples are acquired. In making maps repeated samples 
are averaged and will be reduced by a factor 1 / [Maris et al. (2004)]. So a relatively high eg 
could be acceptable at the level of single samples when observing stationary sources. However the 
ratio between eg and the noise will not change after averaging. So a convenient choice would be to 
consider eg/cr, where cr could be the RMS of Tsky, Tioad or AT depending on the case. The only 
hard constraint which has to be considered is that saturation must be avoided. 
The general formula for the target function is 

xm = Y\Qc(&f', (3.2) 

c 

is a vector in the parameter space, Qc is a function varying over the range [0, 1] with Qc(@) = 1 
if fits the particular criterion c for which the function is defined, Qc(&) = if does not fit this 
criterion. Intermediate values may be also defined in the [0,1] range measuring the goodness of fit. 
As an example, a criterion for optimal q is to have ydiff = nun(-ydiff), the corresponding criterion 
function is Qc{@) = min (ydiff) /ydiff- The exponents n^. > 0, with Yjc^c = 1, are weights defining 
the relative importance of each criterion within a given policy. In general it is better to have Qc(&) 
functions which are derivable. In some case it is necessary to to deal with poles that have to be 
avoided. A method is to define a metric yUc(0) ^ with a single pole for which jUc(0) — > +°° and 
take Qc{&) = e"^'*®) or Qd®) = 1/(1 +6'-^' (®^). Typical criteria are shown in Tab. I 
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Figure 7. Iso contour lines for the functions entering e^^diff and e^ jiff as a function of (ri, r2). Vertical blue 
dashed lines cri , horizontal blue dashed lines era, black dashed lines (^2 - r\)-^, blue contours (ri - r)^ + (r2 - 
r)^, red dashed contours y 0-10-2, and the red contours are the resulting e^y difF- The width of the lines varies 
with the value of the function. 



Table 1. List of possible criteria 
Condition Criterium 





= min(eg,diff) 


Qc 


= min(eg,diff)/eg,diff 




= min(e^,sky) 


Qc 


= min(eg,sky)/e9,sky 


f^.load 


= min(e,yjoad) 


Qc 


= min(egjoad)/ e^joad 


f^.load 


= e^.diff 


Qc 


= 1 - k^.load - e^.diffl/ niax(egjoad + fg.sky) 






Qc 


= 1 - k^Joad - f^.skyl/ max(e^joad + f^.sky) 


e^.diff 




Qc 


= 1 - k^.diff - fg.skyl/ niax(e^4iff + eg,sky) 



3.2 Analytical Optimization 

Analytical optimization (AO) is based on analytical formulas assuming either normally distributed 
or uniformly distributed signals. As a starting point for more refined numerical optimization. At 
the root of this method of optimization is the requirement of minimizing the processing errors, 
as an example the e^diff- However given they diverge at ri = r2 it is necessary to consider the 
maximization of their inverse normalized to the minimal value, as an example defining for e^^diff 
the function Fdiff = min(e^ diff)/f9,diff- These functions become for fi = ^2, unless either ri = r or 
h = f. 

Again it is convenient to use the normalized parameters r„ in this case the covariance matrix 
of the processing errors is 




b) 




Figure 8. Comparison for e<,,diff = fi/.sky top-left, e^,diff = Q/.ioad top- right, e^,sky = e,^,ioad bottom-left and 
the corresponding best fit regions bottom-right; as a function of (ri, ri). The lines are iso-contours of 
exp(-|.!c|) where x is the difference between the two arguments to be compared. For the bottom-right frame 
the iso-contours are tracked for cubic root of the product of the comparison functions. The simulation is for 
Tsky = 7" load = 0, cTji^y = (Tioad =10 ADU, r = 1. No drifts are included. 



.2 ri+h 



(3.3) 



In the framework of the low-level approximation for ^opt calculation, after replacing Eq. (2.54) 
into the e^,diff from Eq. (2.58), substituting r, — > r,, q ^ q and r = r/Ro-, its reciprocal is 



Tdiff = 



m - r)2 + ih - r)2] ^il + f^^-2fig,i)(l+fl-2f2gsi) 



(3.4) 



Fdiff is symmetrical with respect to the axis ri = r2 and has a maximum where the processing error 
has a minimum. There is no analytical way to maximize Fdiff. However, Fig. (7) shows the contour 
plot for the various components of this function. The denominator is the product of a function 
which is constant over circles centered onfi=r2 = f' (or ''i = ?'2 = '') and which increases with the 
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a) 



b 



m =0,m =0,(! =10,o = 10,r = -lnl 
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C) 



d 



m =0, m =0,0 = 10, a = 10, r = -lnt niin(y )/y 
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Figure 9. Iso contour lines identifiing the regions in the (ri, r2) space where min(e£^_(jiff)/e^_diff top-left, 
min(e^,sky)/e?,sky top-right, min(efy,ioad)/e£/,ioad bottom-left, and where all of the are as near as possible 
at their minima, bottom-right. The lines are isocontours of min(x)/x which is 1 when the minimum is 
reached. For the bottom-right frame the iso-contours are the cubic root of the product of the other frames. 
In the top-left frame white + denotes the positions of the analytical solution, the black thick contour is 
for min(e^.diff)/e^,diif = 0.5, the black thin contour lines are for min(e^_diff)/e(y,difF assuming H = H^a- The 
simulation is for Tsky = T^ioad = 0, (Xsky = crioad = 10 ADU, r = 1. No drifts are included. 



radius, and of yf&\&2 which has a more or less elliptical form and that for psi = is centered on 
ri = ri = 0. The numerator is null on the f \ = r2 line and it is constant over lines parallel to ri = r2 
increasing with distance from that line. Hence the Fdiff maxima must be symmetrically aligned 
along a line normal to ri = f2- The line has to cross the f\ = f2 line at f\=f2 = h, with < rc < r, 
so that the maxima for Fjiff are located at 



h - rc± -^imin, (3.5) 

V2 

h - rc + — pCin; (3.6) 
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where fj^m measures their distance from the ri = f2 line. Numerically it is possible to show that in 
the case q^x = sufficient numerical approximations to {mm and as a functions of r are, 

h ~ 0.6994r + 0.2722 (3.7) 



0, 0<r<0.701 
-3.0836r2+6.7034r + 3.0969, 0.701 <r< 1 (3.8) 
-0.3985r2+0.3367r- 0.4369, l<r<10 

Fig. 9a represents a typical pattern for rdiff(ri,r2) (the r, are converted into r,), and it is assumed 
r = l/Rg-. The optimization produced by maximizing Eq. (3.4) could be improved by using the 
approximation for the entropy in Sect. A which takes into account of possible overlaps between 
the Qi and Q2 distributions, allowing a better approximation to ^opt(''i.''2)- So in the figure Tdis 
have been computed by using the method in Appendix A but black contour lines are those obtained 
assuming H = Hca at the root of Eq. (3.4) it is evident that the two approximations agree quite 
well. Crosses mark the position of maxima calculated with the approximated solution described 
above. The Qack factor for this case does not reveal any saturation. So it is possbile to look for 
other combinations of optimized parameters. As an example: Fig. 9 b) and c) are the equivalent of 
Fdiff computed for e^ ^ky, and e^joad- Of course while Fsky has well defined maxima this is not true 
for Fioad given e^joad has not an upper limit. Fig. 9 d) represents the product of Fdiif and Fs^y. We 
may look at combinations of parameters where, e^.sky = e^joad or e^_sky = e^.diff or e^joad = e^.sky as in 
Fig. 8a,b and c, or e^^iff ^ e^joad ~ e^.sky- Thus the product of the second group of criteria in Tab. 1 
can be used, assuming all the lie = 1/4, as shown in Fig. 8d. 

3.3 Dealing with saturation 

A more complex situation could arise if the selected optimal r\ , r2 and ^opt lead to saturation. In 
this case either a (ri,r2,^opt) far from the F^iff peak has to be selected or ^opt has to be increased 
in order to reduce the corresponding Qack factor. In the first case the requirement Cr = cj^' will be 
assessed but the quantization error will be larger than the optimal one. To limit this error the new 
(ri,r2,^opt) would have to be selected as much as possible along the ridge near the Fjifr peak ad as 
much as possible far from the ri = line. In the second case we consider the fact that Qack 1 
so it is possible to take r\ and r2 at the Fdiff peak but to take 

^opt.Qack = •5ft<3™l^opt('^''^''>'^'^^'') 

where = i^ax(|Q5|^^''j|,|Q5|^^''2l) and > 1 is a safety factor which typically is S%_ = 2. Of 

course in this case the data are compressed at an higher rate than Q " while the processing error 
will be increased by a factor >SftQ™j . 

3.4 OCA2K, non idealities and numerical optimization 

Non-idealities in the signal and in the compressor cause the effective to be difi"erent from the 
expected Cj^, and in general Cj^ > Q. A formal way to account for this is to define a compression 
efficiency tjc^ < 1 defined as: 
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ric, = (3.9) 

which could be decomposend in the product of the contributions of each non-ideality. In general 
it is very difficult to account in a satisfactory way for even the most important non idealities as is 
illustrated by the following examples. 

A group of non-idealities comes from the fact that each time a new symbol is discovered in 
the data stream the compressor adds at the compressed output a "stop" pseudo-symbol followed 
by the uncompressed symbol. Then the compressor is coding the symbols in the input data stream 
plus the "stop" pseudo-symbol and consequently the entropy for the compressed data stream to be 
introduced into Eq. (2.17) is changed by a factor ?7~^p 

_1 <^stop , ^Pstop l+log2(l+^stop) ^„ 

^ l+^Cstop 1+Vstop //(l+V?stop) 

where (^stop - -'Vsymb /"samples. -'Vsymb IS the number of different symbols in the packet, njan^pies the 
number of samples stored in the packet and H comes from Eq. (2.17). Note that for ^stop — > 0, 
'7stop — > 1 ■ In general for small ^stop the addition of stopping symbols increases the entropy leading 
to ?/stop < 1, but when i/Jstop is sufficiently large the compressed data chunk is diluted in a large 
number of repeated symbols, reducing the entropy of the signals and giving //stop > 1- However the 
potential gain in Cj^ is compensated by the need to add uncompressed symbols. If A^bits.code is the 
number of bits needed to store the information used to decode a symbol, Lpck is the length of the 
packet, then from the condition 

"samples^bits 

Tvph ^symb^bits,code — ^ck) 

and from Lin = "sampies^^bits, assuming the optimal case Lout = Lpck the dumping factor for the 
compression efficiency is derived 



'/store ~ 



J _l_ ^symb A^bits.code ^xh 



(3.11) 



'^samples Ambits 

In general, for a stationary signal A^symb ^ "samples so that T/store is a second order correction which 
will be neglected in the remaining of the text, but it becomes important in the case of non-stationary 
signals for which A/symb ~ "samples, which could occur in case of fast drifts. 

Two non-idealities very complicated to be analyzed are the difference between the expected 
entropy and the sampling entropy, and the compressor inertia. 

The theoretical estimates of entropy and hence of the expected compression rate, gives the 
expected entropy calculated on an ideally infinite number of realizations of samples. This means 
that even very infrequent symbols for the samples are considered by theory. But the compressor 
stores a few hundreds of samples for each packet leading to a truncated distribution of samples and 
consequently to a sampled entropy which in general is smaller than both the theoretical expectation 
and the entropy measured on a long data stream. In theory if G{Q) is the cumulative PDF for 
the distribution of samples, and if Q is bounded between Qinf and Qsup it would be sufficient to 
rescale the fg by 1 /(G(Qsup) - G{Qm)) and redefine accordingly the sum in the definition of the 
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Shannon entropy. As an example, in the case of a simple normal distribution cutting the distribution 
respectively at 1, 2, 3 and 4cr will reduce the entropy as predicted from Eq. (2.43) respectively by a 
factor ?7q = 0.79, 0.89, 0.95 and 0.98. However, the difference between theoretical entropy, or even 
the entropy measured on long data streams, and the sampling entropy measured on short packets 
could be changed by the presence of correlations in the signals on scales longer than the typical 
time scale of a packet. Last but not least, it is necessary to consider that the compressor takes some 
time to optimize its coding scheme, leading to a further loss in compression efRciency. 

The effect of all of these non idealities are too complicated to be introduced in the theoretical 
model, so that the tuning of REBA parameters based on the theoretical models has to be refined by 
numerical optimisation. Numerical optimisation it is important to handle difficult cases in which 
the hypothesis of the theoretical model fails, it allows experimentation with artificial perturbations 
introduced in the signal and it includes higher order effects such as the packet-by-packet variability 
of Cr, In addition numerical simulations must be used to verify the optimized parameters before 
uploading them to the instrument. 

With these aims the Onboard Computing Analysis (OCA) software was developed, composed 
of a scanner, able to run the same test on diff"erent combinations of REBA parameters; an analyzer, 
able to automatically extract relevant statistics on each test; an optimizer, able to apply different 
policies defining when a combination of parameters is optimal or not selecting the best combi- 
nations; a report generator, used to generate automated reports. Apart from REBA optimization 
the development of the OCA libraries has been driven by the need to have a flexible envirormient 
for testing ground segment operations as explained in [Fraillis et al. (2008b)]. Hence, OCA is able 
to read, decode, and process small amounts of raw data coming from the Planck/LFI scientific 
pipeline from packets to complete timelines. 

At the core of the part of the OCA software dedicated to the REBA optimization there is a C++ 
kernel, (0CA2K) which processes the input data for each combination of REBA parameters perform- 
ing: i. on-board mixing and quantization by using the real algorithm; ii. on-board compression by 
using the on-board algorithm; iii. on ground decompression and reconstruction. 

It has to be noted that 0CA2K uses the same C code for compression operated on-board. So it 
does not emulate the compressor but uses the real compressor. In addition the validation of proper 
emulation of the on-board and on-ground processing has been provided by using data generated 
in the framework of the validation of Level-1. of the Planck/LFI DPC [Fraillis et al. (2008b)]. In 
that way we demonstrated that 0CA2K processes the data in the same manner as the real processing 
chain. 

The input of OCA and 0CA2K are short data streams of raw data downloaded from the instru- 
ment just before on-board averaging or just after it (see Fig. 2) depending on whether A'aver has to 
be optimized or not. In output 0CA2K provides packet-by-packet measures of Cr and its related 
quantities such as the estimated packet entropy, or the measured compression efficiency 77oca2k- It 
provides also sample-by-sample estimates of critical parameters as eq, and Qack- 

Despite 0CA2K is written in C++, it remains a heavy, offline tool, which can not be directly used 
for a crude exhaustive real-time optimization. This is the reason for which analytical methods have 
been developed. On the contrary OCA has the ability to use the analytical models to focus on the 
relevant region of parameters space. 

OCA allows the determination of the optimum parameters according to different optimization 
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strategies and constraints. This is important given the different ways in which REBA parameters 
are optimized during ground tests and in flight. During ground testing the usual procedure has 
been to stabilize the instrument and its environment, calibrate the DAE and then to acquire chunks 
of about 15 minutes of averaged data to be analyzed by OCA to optimize the REBA parameters 
[Cuttaia at al.(2009)]. After setting the REBA parameters another session of 15 minutes of acqui- 
sition, this time with the nominal processing described in Fig. 2 is executed as a cross-check. In 
flight the procedure will be to acquire continuously data by using the nominal processing Short 
chunks of unprocessed data will be acquired daily in turn from each detector. The comparison 
of unprocessed with processed data will allow to monitor of the processing error. In addition the 
REBA tuning might be repeated daily on the chunck of unprocessed data in order to test whether 
some REBA parameters on-board the satellite should be changed or not. 

OCA could be used as a stand-alone application, but different interfaces for OCA to other pack- 
ages have been created for different applications. For ground segment testing OCA provided an 
IDL and C++ library used in a stand-alone program. The same occurred for the Planck/LFI simu- 
lation pipeline where parts of the 0CA2K simulating on-board preprocessing and ground process- 
ing, (excluding compression and decompression) have been included in the Planck/LFI simulation 
pipeline. For the REBA optimization during the ground tests, OCA has been used within the LIFE 
framework [Tomasi at al.(2009)]. For routine operations in flight OCA has been included in the 
PEGASO [Tomasi at al.(2009)] software tool designed to monitor the instrument health and perfor- 
mances at the Planck/LFI DPC. 

3.5 The 0CA2 optimization algoritlim 

As a premise to REBA processing optimization, a value for A^aver. a cj^^ and a function of merit 
X appropriate to the case under analysis have to be fixed. As explained, in general A^aver is already 
fixed by other considerations than REBA processing optimization. A slightly higher than needed 
cj^' is taken in order to allow some margin. While the Fdiff is considered a sufficient function of 
merit, but more complex functions, such as those in the family of functions presented in Eq. (3.2) 
are used as well. 

To allow optimization, a data chunk long enough to allow the generation of about a hundred 
compressed packets is acquired for each radiometer. In general, the data chunk is on-board pro- 
cessed by allowing coadding for the the given A^aver- For that chunk relevant statistics such as Ts^y, 
T'load, CTsky, cTioad, cTskyjoad, ^T^ky, Arioad are measured and from them r and are evaluated. 

The analytical optimization is performed in order to i) determine in an approximate way the 
region of ri , r2 where the function of merit could have a peak; ii) to grid the region ri , r2 (typ- 
ically by regular sampling); iii) to determine for each point in the region the function of merit 
X(T\,r2) and (r°^'™,r2^'™) as well as the (ri,r2) for which ;^'(ri,r2) has its absolute maximum; iv) 
and finally for the previously determined {r°^^^™ ,r^^^™) to determine the (9°p'™ = C)(rj'"™,r2'"™) 
and the ^^jjj^ = qo^tir'^^™ After that max(|Q|^j,(r°^'™,r°^'™)|) i.e. the maximum value 
of |Qack,/(OI among the Qack values determined on the data chunck for ^ = 1 (see Eq. (2.61)) is 
measured. From max(|Q^^]^|) is corrected for saturation if needed. In fact, if max(|<3^^j^|) < 
( 1 - Sfx)q*^ . the analytical optimization returns . as the best estimate of q otherwise it forces 
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^optim ~ "^^^(l^ackl-*/*^^ -<Sft)- In the latter case q^^^^^^ is said to be saturation-limited and of course 



in that case it is expected to have Cr > cj^'. 



After the analytical optimization the jj^^j has to be numerically refined in order to take into 
account the non-idealities of the compressor. If 4^^^^^ is not saturation-limited the 0CA2K is oper- 
ated to determine, by a polynomial search, the best ^opt allowing Cr = for given r°^'™, r"'"™, 
and (9°P"™. in general the search is performed for q in the range max(|Q^^j^(rj'"™,rj''"™)|)/(l -»Sft) 
and 2(3*,. . If (J*,. is saturation-limited the numerical procedure could be in principle skipped. 

^ optim ^ optim ^ f f ff 



However non-idealities could cause < cj^' even in this case and to check for this a single run 

Tot 

of the numerical optimization is performed for the selected parameters. If d > Cr ° the procedure 
is concluded, otherwise the polynomial search is applied. 

When ^opt have been numerically refined the numerical code for r"*"™, r'^^™, (9°p'™ and 
is runned once again to asses the processing error and the histogram of the compression rates. 

The typical time to perform the optimization sampling (r\ , r2) with a grid of 25 x 25 samples 
and a TOI of about 15 minutes of data, is about 20 sec, so that the optimization of the whole set of 
44 detectors takes less than 15 minutes including the overheads for data lO. 



4. Results 



As an exemplificatio, the results of REBA calibration and optimization in the framework of the 
Planck/LFI ground tests are presented here. First a single test is analyzed to compare the analytical 
and the numerical optimizations. After that the results of the calibration for the whole set of 44 
detectors are presented for a real case. 

The results of analytical v.z. numerical optimisation are compared by using real Planck/LFI 
data acquired during the RAA tests of the instrument performed at Thales Alenia Space (Italy), 
during the summer of 2006. Fig. 10a shows 12 min of data with A^aver = 52, equivalent to about 
567 15 samples, while Tab. 2a gives the relevant statistics for the TOI. During the test the instrument 
and its environment were stable, no strong drifts are present in the data. A clear correlation between 
sky and reference-load is evident in the plot explaining the Qsi ~ 1 and the factor of six reduction 
of the RMS when passing from undifferentiated to differentiated data. Also the separation between 
sky and reference-load is not large being just 18cr. So after mixing the distributions for fi and 
^2 will stay well separated, with |Adisti l > 100, when |r2 - ri| > 0.2. In this case it is reasonable to 
expect that both the low-accuracy and high-accuracy methods to estimate analytically the entropy 
will give comparable results. 

Indeed, both models to optimize the REBA parameters for cj^^ = 2.4, give exactly the same 
results as shown in the second column of Tab. 2b and Tab. 2c. 

To test the goodness of the AO, 0CA2K was run imposing q = I and taking the same values of 
rl, r2 used for the AO. The predicted entropy of the processed TOI is compared in Fig. 10b. There 
the relative difference between the entropy measured all over the TOI and the entropy computed 
analytically by using both methods is reported. Patches define intervals of accuracy in steps of 
0.5% up to 3%. Both methods to estimate the entropy are good predictors of the measured entropy, 
apart from the region marked with the white boxes where the low accuracy method overestimated 
the entropy. 
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Figure 10. Example of signal for the detector 2300 acquired during the tests and its optimization. Frame a: 
are the sky and reference-load and AT samples in analog-to-digital units (ADU). The differentiated signal 
has been shifted up, having null mean. Relevant statistics are reported in Tab. 2a. Frame b: represents the 
accuracy by which the analytical model is able to predict the entropy measured on the quantized signal as a 
function of r\ and for ^ = 1. Contours of regions are for accuracies of 0.5%, 1%, 1.5%, 2%, 2.5%. White 
boxes denote the regions where an accuracy worst than 3% is obtained by the simplest analytical model. 
Outside these regions the two models are completelly equivalent. Frame c: represents the accuracy by which 
the most sophisticated analytical model presented in Appendix A is able to predict the Cr for g = 1 . Contour 
lines are for accuracies of 5%, 10%, 15%, 20%, 25%, and 30%. Frame d: compares Fjiff for the analytical 
model with Fdiff computed for the numerical model. Colours are for Fdiff analytical equal to 0.1, 0.2, 0.3, 
0.4, 0.7, and 0.9 while contours are for the same values of Fdiff numerical. The black hexagon identifies 
the regions for the peaks of Fjiff analytical, the crosses for the peaks of Fjiff numerical. Frame e: the same 
as of Frame d, but for Fsky. Frame f: in black on the left, the histogram of Ci- obtained on the 111 packets 
produced by the compression procedure with the optimized parameters of Tab. 2b. The figure gives also the 
5% and 95% percentiles (p05 and p95) and the mean, and in red a gaussian fit of the histogram. In addition 
the figure shows in red at the right the Ci expected from the entropy directly measured on each packet, and 
the theoretical compression rate expected from the theroretical model (CrTh) and from the entropy measured 
on the whole quantized TOI. 



In a similar manner in Fig. 10c the measured Cr and the predicted from the high accuracy 
model are compared. Again the model is able to reproduce within 20% or better the measured Q . 
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As discussed before the differences can be ascribed mainly to the difference between the sampling 
entropy and the expected entropy ([Maris et al. (2000)]) and the not-ideal behavior of the compres- 
sor ([Maris et al. (2000)]). In general the effect of the sampling entropy would result in a higher 
Cr than expected while non idealities in a lower Different ways can be used to calibrate these 
effects, however their interplay with the statistic of the signal is complicated and it is preferable to 
use 0CA2K to fine tune the REBA parameters optimized by analytical mean, given that in general 
the corrections required to properly tune with respect to the analytical prediction q are at most of 
about a factor of two. 

The numerically refined optimal parameters are reported in the third column of Tab. 2c, as it 
is evident the only variation is just for q, the reason is explained by the contour plots in Fig. lOd 
and Fig. lOe which compares the predicted analytical Fdiff and r^ky with the Fdiff and T^^y obtained 
for the numerically refined parameters. Very good agreement is obtained in the location of the 
peaks which determines the optimal ri , r2 and in turn the O. For completeness the last column of 
Tab. 2d reports the theoretically estimated Qack, fg.diff, fg.sky and e^joad after replacing the analytical 
optimal q with the numerical one. The very good agreement between the theory and the experiment 
is evident. 

As expected the quantization error for the differentiated data is smaller by about a factor of four 
than the error for sky or reference-load and anyway the error will be a fraction of ADU, but larger 
or comparable to the quantization error introduced by the ADC converter, which for A^aver = 52 is 
equivalent to ^ 0.04 ADU. 

Eq. (2.24) expresses the processing error for an univariate normal distribution as a function of 
the cr/q ratio, but from Sect. 2.5 it is evident that in the present case the cr/q ratio is not a good 
measure of the processing error. At the opposite, it is possible to define an effective cr/q ratio in 
terms of the processing error as 

h =-^; (4.1) 
\qUff VT2e<;,difF 

which expresses the number of independent quantization levels which could be accommodated 
within lo". So this ratio gives an idea of how well the histogram of the differentiated data is sampled 
assuming it could be represented by an univariate normal distribution ^. A proper sampling would 
assure at least (cr/^)eff > 2 which is the case for this work as it is evident from Tab. 2c having 
(o-/q)eff > 6. 

Before concluding this comparison it is worth commenting the way in which the experimental 
Cr is reported in Tab. 2c. This is best done by looking at Fig. lOf where the histogram for the Cr 
of the 111 packets produced in the test is shown. Given the true Cr is a random variable, varying 
from packet to packet we take the 5% and 95% percentiles assessing that in less than 5% the Cr 
will be respectively smaller or larger than the quoted Cr, as well as the mean and the median (not 
quoted in the figure) of the measured Cr. In this case it is evident that the target Cri2.4 is achieved 
in something less than half of the packets. So it would be better to introduce some safety factor, as 
an example by requiring cj^^ ~ 2.6 or by requiring the median be Cr to be 2.4, or better by asking 
the 5% percentile of the Cr distribution to be ^ 2.4. 

^The (o"/g)eff could be used to characterize the processing in the case the value of e^,sky and e^_ioad is not relevant. 
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Fig. 1 1 is representative of the results of the calibration for a whole set of 44 detectors obtained 
during the Planck/LFI CSL test campaign in 2008 [Cuttaia at al.(2009)]. Data have been collected 
over two acquisitions, the first one being used for the calibration itself and the second one to verify 
the calibration performances. The environmental set-up and the onboard electronics were kept in 
a stable state during both acquisitions. During the first acquisition, called "calibration run", the 
on-board computer was configured to apply just the downsampling step to the data but skipping 
mixing, requantization and compression. The acquired data have been ingested into 0CA2K to 
generate a list of optimized processing parameters for a target Cr = 2.4. Having obtained a set of 
parameters for the REBA, the second acquisition, the "verification run", while the instrument was 
set up to acquire data in nominal conditions, by using the same processing steps that are going 
to be used during flight. At the same time data have been also acquired in the raw format used 
in the "calibration run". So for each detector couples of data streams with and without on-board 
processing were obtained which have been compared in order to measure the processing error, 
following a procedure similar to the one described in [Fraillis et al. (2008b)]. 

Fig. 1 1 compares the mean d, e^^sky/cTsky, e^joad/cioad, e"?,diff/o"diff, where (Xdiff is the r.m.s. for 
the differentiated data. Bars in light colours are for results obtained by processing the data taken 
in the calibration run with the 0CA2 simulator. Bars in dark colours are the results from the data 
processed by the instrument in the verification run. In both cases the same set of optimal REBA 
parameters have been used. There is good agreement between the two runs, despite the presence 
of a few systematics. Such differences are due to slight changes in the environmental conditions 
between the two runs Differences in channels belonging to the same radiometer (00 and 01, 10 
and 1 1) are due to the fact that in the warm back end the two channels go through separate acquisi- 
tion lines, each of them being characterized by different noise properties [Bersanelli et al. (2009)]. 
Such differences are usually small, e.g. detectors (00, 01) and (10, 11) of Feed-Horn #19. In a 
few cases however larger differences occurs, e.g. detectors (10, 11) of Feed-Horn #25. Again, the 
relative processing errors for sky and reference-load are very similar, and in 95% of the detectors 
they are below 0.4 with some extreme deviations such as detectors 10 of Feed Horn #25 and 1 1 of 
Feed Horn #26 for which eg sky/cTsj-y ^ e^,ioad/o"ioadil- Here the optimal ^opt is not peculiar with 
respect to the values required for the other detectors, but optimal ri and r2 are very similar hav- 
ing \r2 -r\\= 0.04 which is the resolution of the search grid in the (ri,r2) space. Such relativelly 
"coarse" resolution in the search grid for optimal {r\,r2) was imposed by constraining the need to 
optimize the REBA parameters within a few minutes after the data acquisition. The coarseness of 
the (ri,r2) grid is also the reason for the apparent coincidence of the mean values of r\ and r2 for 
different frequency channels in Tab. 3. In flight such time constraints will be removed allowing a 
multi step iteration of the optimization procedure and the use of a thinner grid. However, even in 
those extreme cases, the coorrelation between sky and reference-load processing errors leads to 
a much smaller error for the differentiated data. In all cases the final processing error is always 
less than 3.8% of the instrumental white noise. Scaling those numbers to the calibrated sensitivity 
per sample and per detector, the calibrated processing error, ATq, was derived, and is reported in 
units of juK per sample and per detector in line 10 of Tab. 3. On average hJ^ is below the 3 juK 



'^In the CSL tests the satelhte has been kept within a large cryogenic vacuum chamber which was not as stable as the 
L2 environment is. 
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level taken as a threshold for systematics [Mandolesi et al. (2009)] apart from the detector 01 of 
the Feed-Horn #24 for which Mc, ^3.2 fiK. 

The values for the optimal REBA parameters are mainly determined by the frequency of the 
radiometric channel with some dispersion from detector to detector. Tab. 3 gives representative 
median values for ri, ^2, ^ = l/Sq from the CSL tests as well as for the quantities in Fig. 1 1 and the 
resulting data rate. O is omitted since it is the most variable parameter and it has no significative 
impact on and Tab. 3 reports also the number of detectors for each frequency channel, the 
A^aver values which are kept constant, the compressed data rate per detector, per frequency channel 
and for the instrument as a whole. Quantities are reported in the form x + 5x where 5x represents 
the standard deviation taken as a measure of the internal dispersion of x within the given subset of 
detectors. Of course this number must not be interpreted as an error and it must not be propagated. 

The total data-rate in Tab. 3 is just 7% higher than the target data-rate 35.5 bits/sec. Again 
this departure is mainly due to the limited resolution in the search grid as well as small changes in 
the environmental conditions between the two runs. In order to cope with this problem it is likely 
that during operations a safer cj^' = 2.5 target will be set in place of the nominal 2.4. 

Finally it is worth to consider the gain in the accuracy of the REBA optimization obtained by 
the complex procedure described in Sect. 3 with respect to the fairly simple scheme used in the 
earlier RAA test campaign [Bersanelli et al. (2009)]. During the RAA tests a simplified algorithm 
had been applied based on the fact that putting r\=r the processing error for the differentiated data 
reduces to Eq. (2.60) which is independent of ri and r2- Hence, the only free parameters where q 
and r2 r. The optimization was performed by imposing Q = 2.4 and selecting those parameters 
for which e^y^sky ~ e^joad ~ eg.diff- Even in this case the required Cr = 2.4 was achieved but eg.diff/crdiff 
was between [0.08,0.14], when compared to the current eg^nff / cTdig < 0.038 it is evident how this 
procedure represents a substantial improvement. In particular LFI has as a target of keeping all of 
the instrumental systematics and non gaussian noises in the differentiated data below 10% of the 
instrumental white noise [Bersanelli et al. (2009)]. The optimization scheme described here allows 
the reduction of the processing error on the differentiated data by a factor of four pushing it below 
this ambitious target. 

5. The impact of the on-board processing noise on the Planck scientific performances 

A detailed analysis of the effect of on-board plus on-ground processing on the final scientific prod- 
ucts of Planck is beyond the scope of this paper The subject is simply too complex to be analyzed 
here, and the analysis has to be specialized to take into account each specific kind of product ob- 
tained from the Planck data i.e.: calibrated time lines, frequency maps, component maps, angular 
power spectra for those maps and cosmological parameters. Neglecting in this discussion the role of 
ri and r2 it is enough to say that for most of these products the effect of processing could be reduced 
to the effect of processing on the signal spectral decomposition in the time or in the spatial domain. 
Without to enter in to too many details, some consideration could be derived from the analysis 
reported in [Maris et al. (2004)] and from a forthcoming work in progress [Maris et al. (2009)]. 

In the extent in which the processing error for differentiated data is small when compared to 
the fluctuations in the signal, the noise model for the quantization error data could be applied to 
derive the level of degradation in the noise properties of the instrument. In the Planck/LFI case. 
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for differentiated data a typical (eg diff/Cwn) < 0.1 by using Eq. (7) of [Maris et al. (2004)] the final 



instrumental performances will be degraded approximately by a factor y 1 +(eg,diff/o"wn)^ < 101. 
This immedialty applies to the amplitudes of the spectral decomposition of the signal. Since pro- 
cessing acts as stationary white noise it will increase by less than 1% the power excess introduced 
by the white noise. On the other hand the effect on the phases will be similar to to random scrabling 
over the [0, 2n] interval, but even in this case the effect of quantization will be to increase by few 
percent the effect of the instrumental white noise. It is immediately possible to extend these results 
to the Planck/LFI calibration which will be based on measurents of the amplitude of the cosmolog- 
ical dipole, to understand how, when compared to the white noise effect, the calibration accuracy 
will be worsened by less than 1% by the processing noise. 

It is more complex to consider the case of the Planck/LFI sensitivity to primordial non- 
gaussianities. In [Maris et al. (2004)] the level of perturbation the measurements of primordial 
non-gaussianities introduced by the quantization were compared to the level of perturbation intro- 
duced by the residuals astrophysical foregrounds concluding that they are very small. However, it 
is worth analyzing the effect of processing on non-gaussianities even assuming an ideally perfect 
separation of the foregrounds. To the extent that the quantizer has null expectation, null skewness 
and in general null central moments of odd order, no effect is expected on tests of primordial non- 
gaussianity which are sensitive just to central moments of odd orders. Of course a symmetrical 
quantizer will alter the central moments of even order, such as the kurtosis. To estimate this effect 
it is sufficient to compare the distribution of the processing noise from a uniform quantizer with the 
case of normal white noise while taking the average over repeated measures. In the white noise 
case the central moments of even order are yu„ = Cg{n)cr" /N"^^, with Cg{n) = n!/(2"''^ *(n/2)!) * s^n). 
In the case of the quantizer il„{N,q) = C{N,n){ql yf\2)" jN'^l'^, whatever the form of C{N,n) is the 
Central Limit Theorem assures that for any n 



but given is finite, a bias in the estimator for each moment of order n will appear. Expanding 
C(N,n) about 1 /N up to the leading term 



where A„ is a serie of coefficients having |A„| < 1 whose first three elements are A2 = 0, A4 = -2/5, 
A(, = -6/5, Ag = -12/7. Since for Planck/LFI N ^ 60 the bias in the expectation of higher order 
moments will be very small. In addition, since is a characteristic parameter of a mission, any 
bias due to processing in the estimators of central moments could be predicted and removed. 

In closing these considerations it is worth noting that for Planck/LFI the most important lim- 
itation to apply an hard requantization of data comes from the need to hmit the effect on the total 
power data rather than on the differential data. Firstly it has to be noted that the cancellation effect 
of quantizzation errors on sky and reference-load from which Eq. (2.58) is derived, applies only to 
the {P\,P2) space of mixed data to the extent in which g/cr, < yfl2/3 for any i = 1, 2. Another limit 
in the maximum amount of processing noise which could be introduced in total power arises when 
it is taken in consideration the fact that measurements in total power on the reference-load signal 
have the potential to be a valuable tool as a source of diagnostics of instrumental systematics. An 




lim C{N,n) = Cg{n). 
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example is given by the study of thermal effects induced by the instability of the reference-load 
which could be detected by cross-correlating Tioad with measurements of temperatures acquired 
by thermometers located in the Planck focal plane, which limits the quantization on Tioad to be 
eg.ioad/o-ioad < 1/2 [Maris et al. (2009)]. 

6. Final Remarks and Conclusions 

As for many past, present and future scientific space missions, the ESA Planck mission has a 
limited bandwidth to download the scientific data produced by its two instruments. The allocatable 
bandwidth for the Planck/LFI is about 2.4 times lower than the raw data flow produced by the 44 
detectors comprising the instrument, which is made of an array of 22 pseudo-correlation receivers, 
each one comparing the signal received from the sky with a reference signal. To fit the allocated 
bandwidth data have to be preprocessed on-board and loss-less compressed prior to transfer to the 
ground, where each step of the on-board preprocessing has to be reversed to recover the original 
information. Since not all of these steps are completely reversible, a overall reduction of the quality 
of the data occurs, which has to be quantified and reduced as more as possible, not to degrade the 
instrument performace. 

This paper has presented a detailed discussion of the on-board plus on-ground processing for 
Planck/LFI and of its free parameters which can be adjusted in order to fit the compressed data- 
rate to the allowed bandwidth. In addition, this paper has presented a model to quantify the level 
of distortion in the scientific data as a function of the free parameters for the on-board processing 
and as a function of the attainable compression rate. At last the paper has reported on the way 
these parameters are optimized to cope with the required bandwidth while limiting the processing 
distortion. 

Three new results about the way in which the output of a pseudo-correlation receiver could be 
handled are presented. First, a new algorithm: mixing followed by requantization and interlacing 
to prepare the data stream for compression, limiting at the same time the amount of processing 
distortion is introduced. This method is effective, since most of the time variations in the sky and 
reference-load data streams are correlated. Mixing reduces the effective variance of the signal to be 
compressed therefore relaxing the need for requantizing the data. At the same time the processing 
errors in the mixed data are correlated, so that on ground demixing introduces cancellation effects 
which reduce the re-quantization error in the differentiated signal. Second, a model which quanti- 
fies the level of distortion in the scientific data as a function of the free parameters of the on-board 
processing and as a function of the attainable compression rate is given. Third, it presents a general 
procedure to search for optimized processing parameters which has to cope with the proper use 
of the allocated bandwidth, requiring a nominal compression factor of 2.4, which has to limit as 
much as possible the processing distortion, and has to be fast given that both in the pre-launch tests 
campaign and in flight only a short time could be allocated for optimization. 

The optimization procedure is based on a combination of the analytical model and of a soft- 
ware simulator, all integrated into a single applicative program called 0CA2 which takes as input a 
Planck/LFI data stream applying to it the whole on-board and on-ground processing to measure 
the processing distortion and the data-rate. The two approaches complement each other: the ana- 
lytical model is very fast and could be used to rapidly select regions of interesting combinations of 
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parameters; the simulator is able to handle conditions which hardly can be analytically modelled 
and is used to refine the parameters identified by the analytical model. The input data for 0CA2 
can be provided indiflFerently by the Planck/LFI flight simulator [Reineker, Dolag (2006)], or from 
specific acquisition sessions performed during the test campaign or in flight. A wide selection of 
combinations of processing parameters and optimization criteria can be explored by this code. In 
practice this work shows that the analytical model is refined enough to allow a full optimization 
of all of the processing parameters apart from the on-board re-quantization step, q, which has to 
be tuned numerically to account for a number of non-idealities in the data and in the compressor, 
part of which have been discussed in the paper. However, it is interesting to see that in no cases 
there are no situations where the diff"erence between the analytical and the numerical model in the 
optimized q is larger than a factor two. 

The last part of this work reports the performances of the optimized on-board algorithms in 
the framework of the pre-launch tests required for instrument qualification. In that case it has been 
demonstrated that the 2.4 compression factor required to operate Planck/LFI could be attained 
introducing a modest quantization noise equivalent to a 3.8% of the unprocessed signal rms, which 
is equivalent to an increment in the instrumental noise of less than 1%, in particular processing 
is not harmful to the scientific exploitation of Planck/LFI data such an example in the study of 
primordial non-gaussianities. 

In conclusion it is worth noting that the apphcation of mixing follwoed by requantization is of 
general use and could be extended outside the case of the Planck/LFI since it could be used in any 
situation in which data alternatively taken from a signal source and a reference source are sent to a 
remote station. 

Acknowledgments 

Planck is a project of the European Space Agency with instruments funded by ESA member states, 
and with special contributions from Denmark and NASA (USA). The Planck-LFI project is devel- 
oped by an International Consortium lead by Italy and involving Canada, Finland, Germany, Nor- 
way, Spain, Switzerland, UK, USA. The Italian contribution to Planck is supported by the Italian 
Space Agency (ASI). We thank the support of the Spanish Ministry of Science and Education. 

A. Approximation of tlie bivariate entropy 

This section presents the approximation of the entropy for an interlaced bivariate distribution for 
the two limiting cases of a uniform distribution or a normal distribution. 

The case of two uniform distributions 

The case of two uniform distributions is analyzed by defining the intervals where the distributions 
are not null as Qi i < Q\ < Qi,r and 02,1 < Qi < Q2,i, and their widths A'l = Qi r - Qi,i, N2 = 
22,r- 22,1, their centers Qi = (Qi j + Gi,r)/2, Q2 = (62,1 + G2,r)/2. Without any loss of generality 
Ni < N2 is assumed. The entropy is a linear function of |A| = |Q2 - 2i| bounded between the lower 
limit 
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™" 2\ N2j 2 \Ni N2) ^\Ni NoJ 
in the case the two distributions are completely overlapped, and the upper limit 

//oo=log2A^i+log2A^2 + l, (A.2) 
for the case of complete separation of the two distributions. So that 

[ IM < /^min 

H = \'-2f^<\A\<'-^,{H^-H^y^^^+H^, . (A3) 
[ ^ < |A|, 

The case of two normal distributions 

Whithout any loss of generality, it is possible to reduce to the case of two sources of normal- 
distributed interlaced signals having respectively variances equal to 1 and cr^ > 1, and quantized 
with a quantization step q <l. 

An approximation for /f(Adistr,^,o") can not be derived analytically, but in a manner similar 
to the case of the uniform distribution the entropy is bounded between a lower limit, /fmin, and an 
upper limit Hco, while varying Adistr- In Fig- 12a shows how the entropy varies as function of Ajistr 
for three different values of cr. In addition both H, Hca and /fmin are proportional to - log2 q so that 
their differences does not depend on q. For this reason it is convenient to define the Normalized 
Entropy, h 

h = ; (A.4) 

^min J^co 

which is just a function of Adistr and cr as shown in Fig. 12b (as full lines) for three values of cr. 
Having h, H - Hoo and q 

H = Hco-log2q + {Hco-Hnim)h, (A.5) 

note that there is not need to estimate ffmin, and that Hoo could be readily estimated from Eq. (A.6) 
by putting o") = 1, cr = 0-2/0"!, and expressing q in units of o"i. 

The difference (Hca - H^nm) ^ 1 bit is always positive and just function of cr. It is null in 
the limit cr — > +00 as shown in Fig. 12c. The figure displays with a dotted line an approximation 
obtained numerically for 1 < o"2 < 4000 for which 



Hco Hmin — exp 



^A„(logo-2)"+Ao 
«=i 



(A.6) 



with Ao = 1.0893 x lO'^, Ai = -8.3819 x lO'^, A2 = -2.3699 x lO'^, A3 = 4.8141 x lO'^, A4 = 



5.1620 X 10-^ A5 = 2.1425 x lO""^ within an accuracy of ±1.1% 



'0. 



For h a numerical approximation at the level of a ±1% accuracy is 



distr 

. (A.7) 
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Here o"* is just a function of cr, and it is bounded between 0.25558 < cr* < 0.30797 with an ^ 
21% variation, as shown in Fig. 12d. An upper limit for h is derived by putting cr» = 0.30797 
overestimating the entropy of at most (^/I^Z^^I^^)0.16 < 0.16. However cr^ is a function of 
{Hoo - Hynin) allowing to derive 

cr, = V6-8497 X 10-2 + 2.5965 x 10-2(/f^ - + e (A.8) 

with |e| < 2 X 10"^. With this approximation the typical accuracy in estimating H is better than 
0.01 0.03 bits and the optimal q for a given /ftgt can be derived from 

log2 ^opt = (H^in - Hco)h +H^- /ftgt, (A.9) 

within a relative numerical accuracy of about 3%. 

In short, the algorithm of optimization becomes for given ri, r2, q, cri, 0-2, T\, T2 is: i.) 
compute cr, Adistr, ii.) compute cr,, h, Hoo, Hoa - H^m, iii-) compute ^opt- 

B. ADC quantization 

Throughout this work it is assumed that the ADC quantization is not relevant for the scopes of this 
paper. However, it is worth to briefly recall its impact, in particular looking at the conditions at 
which the ADC noise could be neglected. 

The resolution, or quantization step of the ADC, ^adc is given by (Vmax - ^min)/2^^ Volts/ADU, 
after averaging by A^aver samples ^adc is reduced by a factor 1 / VMiver- The efl'ect of the ADC 
resolution is to add in quadrature a non-Gaussian noise to the signals of RMS 1 / VT2 before 
averaging and 1 / VT2/Vaver after averaging. In addition the ADC itself adds a random read-out 
noise of cr^Dc ADU which after averaging is reduced to cr adc/ V-^avei- When combined these 
two noises the readout noise whose RMS is o"ron = Vl/12 + crADC before averaging and iXron = 
Vl/12 + crADc/ V-'Vaver after averaging. 

When a signal of RMS ctq is input to the DAE a gain, G, is applied and then the measured 
RMS is 

c^=V^L + GV2, (B.l) 

depending on the ratio cr^on/Gcro. The measured RMS will be dominated by the ADC noise or 
by the signal RMS. Signals whose RMS is comparable to the read-out noise are defined as weak 
signals. Of course in the case of weak signals the read-out noise is no more negligible when, 
as an example, the ctq has to be measured in order to estimate the Tsys- The same is true when 
the variation of the RMS of the signal tacking in account of variations of G has to be estimated 
[Cuttaia at al.(2009)]. In addition, given the 1/12 factor in front of the variance induced by the 
ADC contribution, the read-out noise could be dominated by the ADC noise when cr adc > 0.3. As 
a practical example if ctadc ~ 0.5 and o"o ~ 1 then cTron ~ 0.57 and the bias in estimating ctq will be 
^ 15%. 
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C. DAE Tuning 



In an ideal scheme of operations, the various stages of a complex instrument such as Planck/LFI 
would have to be calibrated sequentially, so that the calibration of the REBA parameters would 
be the last step of the calibration procedure [Cuttaia at al.(2009)] and would have no effect on 
the previous stages of calibration. Practical experience has shown that there is a case in which 
the tuning of the acquisition electronics has consequences on the subsequent tuning of the REBA 
parameters. Indeed, the hypothesis at the root of the whole compression scheme is that the noise 
variance of the input signal is large. This is in general true but this hypothesis could fail if the 
variance of the signal after ADC quantization, on-board coadding and mixing becomes too small. 
In that case the signal will be over compressed with > Q- ^' and the squared quantization error 
will be larger or equal to the signal variance. To avoid this case either the DAE gain G and A^aver 
have to be properly tuned, or a set of particular combinations of rj, r2 values has to be excluded. 

The problem is to ensure cri and 0-2 to be greater than a minimal cr,gt typically assumed to be at 
least 2 adu in a suitable range of ri, r2 values. From Eq. (2.35) it is evident that the crj as a function 
of r, defines two identical concave parabolas with a minimum in ri = r2 = rmin = crsky,ioad/o"foa(j. 
where both cri and 0-2 takes the value 



= Crsky^/r^, (C.l) 



where ^psi is the correlation coefficient between sky and reference-load. Note that cTmin = just as 
in the case of a perfect correlation between sky and reference-load. So a sufficient condition to 
asses proper DAE calibration is 

CTtgt < cr s]jy 



which puts a constraint on the minimum G/ VMiver which could be accepted. In particular assuming 
the quantization and the readout noise are small with respect to the sky and reference-load RMS, 
at first order 

G o-tgt 

^ (C.3) 



where os^^y ^ is the sky RMS with G = 1 and no averaging is applied. 

It could happen that in some cases the condition (C.3) can not be full-filled for any reasonable 
value of G and A^avei - So a forbidden region in the ri, r2 space is defined by the need to have 
CTtgt < min(cri,cr2). This defines a "cross" centered into n =r2 = r^m = o"sky,ioad/o"foad' ^^S- 
with "harms" parallel to the two axis of the ri , r2 space and having for each harm a width 

Ar = 2^Jl^t-il-gl). (C.4) 

During DAE calibration, the operators could monitor the evolution of Ar as G/ ^/N^ varies. In 
general the optimization of ri and r2 is performed by scanning a rectangular region in the (ri,r2) 
space of limited width, thus an informative parameter to avoid to prevent a proper REBA calibration 
after DAE calibration is to check the fraction of area of the region of interest excluded by the DAE 
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calibration /dae.bxci- It is not possible of course to write a general formula for all the possible cases, 
but if is small, the excluded region has a center near n = = 0, while the optimization region 
is a square, centered on the origin with -r^^ <ri,r2< +nim in that case 

(4riim-Ar)Ar 

/DAE,excl - 7^ ■ (^-J) 

Ari. 

lim 
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Table 2. Table for the optimization example illustrated in Fig. 10. Subtable A) reports sky and reference- 
load statistics together with their correlation, and the derived r and Ra-. Subtable B) reports the optimized 
REBA parameters, obtained respectivelly with the analytical approximation, and subsequently with a nu- 
merical scan of the parameter space. The processing statistics are reported in the Subtable C) where the 
processing errors and the Cr expected from the analytical model are compared with the ntmierical results. 
The third column of the subtable gives the theoretical expectations after replacing q obtained from theory 
with q obtained by numerical means. 



A) TOI Statistics 





sky 


reference-load 


Combined 


Mean [adu] 


12041.29 


12313.63 




RMS [adu] 


9.72 


10.06 




Slope [adu/sec] 


0.026 


0.027 










0.9988 


r 






0.9779 








0.9659 


RMS(Ar) [adu] 






1.45 




B) Optimized Parameters 






Analytical 


Numerical 




rl 


1.25 


1.25 




rl 
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0.83 
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0.317 






C) Processing Statistics 
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Analytical 


Numerical 


numerical q 
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Adistr 
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CTi [adu] 


3.291 


3.292 
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0-2 [adu] 


1.885 


1.884 


- 


e^.difF [adu] 
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0.067 


0.068 


eq.sky [adu] 
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eq,\oad [adu] 
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Figure 11. Results for a typical session of REBA parameters tuning during the CSL test campaign. From 
top to bottom the figure reports for each detector the mean Cr, Qy.sky/csky, e^.ioad/cioad, e£/,diff/o"difF where 
cTdiff is the rm.s. of the differentiated data. The red line in background of the top frame denotes the target 
Q*^' = 2.4. Values are represented by bars. Light-bars are the results from the calibration phase, where 
raw data from the instrument are processed by 0CA2. Dark-bars are results from the verification phase, 
where processing is performed on-board. The secctgWrame from Top gives an example for detector 00 
of Feed-Horn #19. Feed-horns are numbered according to the internal Planck/LFI convetion assigning at 
Planck/LFI the Feed-Horns numbers from #18 to #28. Detectors belonging to the same Feed-Horn are 
grouped together as shown in the third frame from top. 



Table 3. Representative REBA Parameters, the measured and relative processing errors from tlie CSL 
tests for Planck/LFI. Detectors are grouped by frequency channel, for each quantity x the table reports its 
group median and group standard deviation 6x as a measure of the group internal dispersion, 6x must be not 
considered as an error. 





30 GHz 


Frequency Channel 
44 GHz 


70 GHz 


Detectors 


8 


12 


24 


^aver 


126 


88 


53 


n 


1.042 + 0.032 


1.042 ±0.024 


1.042 ±0.012 




0.917 ±0.065 


0.917 ±0.025 


0.958 ±0.020 


q [adu] 


0.297 ±0.034 


0.198 ±0.044 


0.279 ±0.048 




2.400 ±0.024 


2.440 ±0.019 


2.380 ±0.023 




0.420 ±0.278 


0.269 ±0.184 


0.177 ±0.063 




0.432 ±0.267 


0.271 ±0.183 


0.178 ±0.063 




0.0341 ±0.0016 


0.0369 ±0.0010 


0.0351 ±0.0010 


AT, [mK] 


1.759 ±0.148 


2.412 ±0.356 


1.905 ±0.287 


Data Rate per Detector [bits/sec] 


454.9 ±4.1 


640.8 ±4.7 


1108.2 ±9.8 


Data Rate per Frequency Channel [bits/sec] 


3641.8 


7689.9 


26600.3 


Total Data Rate [bits/sec] 




37932 
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Figure 12. Frame a: Entropy variation for a couple of normally distributed signals as a function of Adisa 
for three values of cr. The reference entropy is //oo. Frame b: Normalized entropy for a couple of normally 
distributed signals as a function of Ajistr for three values of cr. Full lines: numerical integration. Frame c: 
Normalized entropy difference Hoo - //min as a function of cr for a couple of normally distributed signals. 
Full lines: numerical integration. Dots: approximated formula. Frame d: The cr, parameter as a function 
of cr for a couple of normally distributed signals. Full lines: numerical integration. Dots: approximated 
formula. 
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Abstract: To asses stability against 1 // noise, the Low Frequency Instrument (LFI) on-board the 
Planck mission will acquire data at a rate much higher than the data rate allowed by the science 
telemetry bandwith of 35.5 kbs. The data are processed by an on-board pipeline, followed on- 
ground by a decoding and reconstruction step, to reduce the volume of data to a level compatible 
with the bandwidth while minimizing the loss of information. This paper illustrates the on-board 
processing of the scientific data used by Planck/LFT to fit the allowed data-rate, an intrinsecally 
lossy process which distorts the signal in a manner which depends on a set of five free parameters 
i^aver, r\, r^, q, G) for each of the 44 LFI detectors. The paper quantifies the level of distortion 
introduced by the on-board processing as a function of these parameters. It describes the method 
of tuning the on-board processing chain to cope with the limited bandwidth while keeping to a 
minimum the signal distortion. Tuning is sensitive to the statistics of the signal and has to be 
constantly adapted during the flight. The tuning procedure is based on an optimization algorithm 
applied to unprocessed and uncompressed raw data provided either by simulations, pre-launch 
tests or data taken from LFI set-up to operate in a special diagnostic acquisition mode. All the 
needed optimization steps are performed by an automated tool, 0CA2, which simulates the on- 
board processing, explores the space of possible combinations of parameters, and produces a set 
of statistical indicators, among them: the compression rate and the processing noise eg. For 
Planck/LFI it is required that Cr = 2.4 while, as for other systematics, eg would have to by less 
than 10% of rms of the instrumental white noise. An analytical model is developed that is able 
to extract most of the relevant information on the processing errors and the compression rate as a 
function of the signal statistics and the processing parameters to be tuned. This model will be of 
interest for the instrument data analysis to asses the level of signal distortion introduced in the data 
by the on-board processing. This method was applied during ground tests when the instrument was 
operating in conditions representative of flight. Optimized parameters were obtained and inserted 
in the on-board processor and the performance has been verifled against and the performance has 
been verified against the requirements, with the result that the required data rate of 35.5 Kbps has 
been achieved while keeping the processing error at a level of 3.8% of the instrumental white noise 
and well below the target 10% level. 

Keywords: (Cosmology): Cosmic Microwave Background - Submillimeter - Methods: 
numerical - Space vehicles: instruments. 
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1. Radiometer model and acquisition chain 

Planck/LFI [Bersanelli et al. (2009)] is based on an array of 22 radiometers assembled in 1 1 Ra- 
diometric Chain Assemblies (RCA) in the Planck focal plane. Each RCA has 4 radio frequency 
input lines and 4 radio frequency output lines, hence the number of radio frequency outputs to be 
measured by the on-board electronics is 44. Each feed-horn has 2 orthomode transducers extract- 
ing the two orthogonal components of linear polarization in the signal received from the sky and 
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8192 Hz 
Clock 



Figure 1. A schematic view of the main flow of scientific data for a single RCA of Planck/LFI. Each 
RCA has two detectors, but in this scheme only the first is represented and schematized. For graphical 
purposes the scheme represents just the first detector while connected to the reference-load, while Detector 
2 would be connected to sky. At a change of the Clock phase the two detectors will switch their connections. 
The block arrows represents the flow of digitized data and telemetry toward the spacecraft and the flow of 
telecommands from the spacecraft. 



feeding one of the radio frequency input lines of a radiometer, the other radio frequency input line 
is connected to a reference-load held at the constant temperature of 4.5 K. 

Each radiometer acts as a pseudo-correlation receiver [Villa et al. (2009)] measuring the dif- 
ference in antenna temperatures, Ar, between the sky signal, Tsky, and the reference-load Tioad, 
[Valenziano at al.(2009)]. However, given the sky and the reference-load have different mean tem- 
peratures the reference samples have to be scaled by a Gain Modulation Factor, r, which balances 
the difference between Tsky and Tioad to a mean[A7'] = so that 



Ar = rsky-rrioad. (i.i) 

A proper choice of r will allow near cancellation out most of the first order systematic errors 
[Mennella et al. (2003), Mehinold at al.(2009)], assuring in this way optimal rejection of system- 
atics, in particular drifts and the 1// noise [Mennella et al. (2009)]. As a first approximation it is 
possible to put 

meanfrsky] + Tnoise 

r~ — , (1.2) 

mean[rioad] + ^noise 

where Tnoise is the noise temperature. Eq. (1.2) makes evident how different values of r are needed 
in the various phases of the mission. In particular three cases are important: ground tests, in-flight 
cooling phase and finally in-flight operations with the instrument in nominal conditions. As an 
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exemple consider the case of the 30 GHz channel which is the least noisy channel of Planck/LFI 
having an expected Tnoise ~ 10 K. In on-ground testing the mean meanj^rskyj ~ mean[7'ioad] and 
sor^l ([Bersanelli et al. (2009), Mennella et al. (2009)]). In flight the mean[rsky] ~ 2.725 K but 
during the cooling the mean[rioad] varies from ^ 20 K down to the nominal mean[rioad] ~ 4.5 K. 
Thus the r varys from ^ 0.4 when the instrument starts to cool-down to ^ 0.88 at the end of the 
process when its nominal temperature is reached. With higher values of Tnoise the other channels 
will show smaller departures in their r from 1 as well as a lower sensitivity to the environmental 
conditions. 

To acquire sky and reference-load signals each radiometer has two separate radio frequency 
inputs, and correspondingly two radio frequency outputs, each one connected to a radio frequency 
detector and to an acquisition chain ending in a 14 bit analog-to-digital converter (ADC) housed 
in the Digital Acquisition Electronics box (DAE) [Bersanelli et al. (2009), Villa et al. (2009)]. The 
output of the DAE is sent to the Radiometer Electronics Box Assembly box (REBA) ' which pro- 
cesses the data from the DAE, of interpreting and executing telecommands, and of interfacing the 
instrument with the spacecraft Central Data Management Unit. This unit produces the scientific 
packets to be sent to the ground [Herreros et al. (2009)]. 

The DAE applies an individually programmable analogue offset to each input signal prior 
to applying individual programmablt gains and performing digitization. The contribution to the 
read-out noise budget from the ADC quantization is in general considered marginal. Appendix B 
discusses the case in which this hypothesis is no longer valid. The offset and the gain are adjustable 
parameters of the DAE and it is assumed that their calibration is independent from the REBA cali- 
bration [Cuttaia at al.(2009)] with an exception which is discussed in Appendix C. The ADCs are 
fetched in turn and the data are sent to the Science Processing Unit (SPU), a Digital Signal Pro- 
cessor (DSP) based computer which is part of the REBA [Herreros et al. (2009)] not represented 
in Fig. 1. The SPU stores the data in circular buffers for subsequent digital processing and and 
then applies the on board software pipeline to the data. In the process the 14 bit single samples are 
convert to 16 bits signed integers. The content of each ADC buffer is processed separately by the 
on-board processing pipeline and sent to ground. 

As usual in these kinds of receivers, the required stability of the radiometers is assured by 
switching each radiometer between the sky and reference-load. Thus each output alternatively 
holds the sky and the reference-load signal (or the reference-load and the sky) with opposed phases 
between the two channels. Hence, each buffer contains strings of interlaced sky — reference-load 
(or reference-load — sky) samples in increasing order of acquisition time, t i.e. 

T^ADC T^ADC 7. ADC 7. ADC ^1 on 

sky,r=0'^ load,(=l'^ sky,f=2 ' load,f=3 ' " " • ' ^ ^■•^> 

or 



7. ADC 7. ADC 7. ADC 7. ADC /, /.x 

^ load,;=0 ' ^ sky,/= 1 ' ^ load,r=2 ' sky,r=3 ' • " • " ^'^■^> 

The switching frequency is fixed by the LFI internal clock at 8192 Hz. The switch clock gives also 
the beat for the ADCs, which are then synchronized with the switching output, and it is sensed by 



'LFI has two redundant REBA units, but since they are perfectly equivalent in what regard the on-board data pro- 
cessing, all over the paper we will consider LFI as having one REBA only. 
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Figure 2. Schematic representation of the scientific onboard and ground processing for the Planck/LFI. 
Cyan boxes represent REBA operations, yellow boxes ground operations. Green pads specify the parameters 
needed by each operation. TOI could be produced both in undifi^erentiated form (Tsky, Tioad stored separately) 
or in differentiated form. 

the on-board processor, which uses it to reconstruct the ordering of the signals acquired from the 
ADCs and to synchronize it with the on-board time. This frequency also synchronises the ADCs 
with the input and is used by the SPU to reconstruct the ordering of the signals acquired from the 
ADCs and to synchronise them with the on board time. 

The data flow of raw data is equivalent to 5.7 Mbps; a large amount of data that cannot be 
fully downloaded to the ground. The allocated bandwidth for the instrument is equivalent to only 
53.5 kbps including all the ancillary data, less than 1% of the overall data generated by LFI. The 
strategy, adopted to fit into the bandwidth, relies on three on-board processing steps, downsam- 
pling, preprocessing the data to ensure lossless compression, and lossless compression itself. To 
demonstrate these steps, a model of the input signal shall be used. 

1.1 Signal model 

We describe quantitatively the kind of signal the pipeline has to process by modeling the output of 
the DAE as a function of time, t, as 



T'skyCO = 7'sky + ArskyCO + Msky, 
T'loadCO = 7'ioad + ArioadW + ^load- 



(1.5) 
(1.6) 
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where Tg^^, Tioad are the constant part of the signal. AT^ky, and Arioad a possible deterministic time 
dependent parts, representing drifts, dipoles, oscillations and so on, Wsky and nioad represents the 
random noise whose moments are cr^ cr^ and whose covariance is cr„ joad- 

The pipeline described in the following sections needs to be tuned to obtain a proper level 
of data compression which is largely determined by the covariance matrix of the signal whose 
components are 



'^sky = var[Ar,ky]+<ky (1-7) 
'^Ld = var[Arioad]+<ioad (1-8) 

0"sky,load = COV^AFsky, AFioad] + 0"„_sky,load (1-9) 

where it has been assumed that the random and deterministic parts are uncorrelated. It is useful to 
identify two extreme cases: the data stream is signal dominated, when var[ Arsky] + var[Arioad] » 

■^n sky ■'" load' "^^^^ Stream is noisc dominated, when var^ATsky] + var[Arioad] ^ cr^ sky 

cr^ j^^^. In the noise dominated case, the statistics of data will be largely determined by the statistics 
of noise, which in general could be considered normally distributed and uncorrelated over short 
time scales, given the l//-noise will introduce correlations over long time scales. In the signal 
dominated case the statics of data will be instead determined by the kind of time dependence in 
the signal. As an example if ITsky - ^loadl is large compared to the noise, while AJsky and Arioad 
are negUgible, the histogram of the signals will resemble the sum of two Dirac's delta functions 
6(x - Tsky) + S(x - Tioad) convolved with the distribution of noise. 

If a linear time dependence of the kind AT(t) = Af + C is present the distribution of the samples 
will be uniform, and bounded between T + At/2, where r is the time interval relevant for the signal 
sampling. The variance will be A^/12 where A^- = At is the drift amplitude over the time scale t. 
The signal could be considered noise dominated if r < ^/T2o■/\A\. From the point of view of data 
compression, in determining whether a signal is noise dominated or not, the critical factor is the 
time scale t. For our coupled signals, denoting with Agky and Aioad the drift rate in the sky and 
reference-load signals, and with Asky.r, ^ioad,T the relative amplitudes, the relevant components of 
the covariance matrix will be 



A^ 

var[Ar3ky]^ = ^ (1.10) 
A^ 

var[Arioad]r = "yI^ (1.11) 

cov[Anky,Ar.oad]^ = ^^'^';^'"''" (1.12) 

More complicated distributions may occur for a polynomial time dependence of the kind AT oc 
f", or for a sinusoidal time dependence of period P: AT oc sin(2nt/P), but in most cases a simple 
linear drift Ar oc t could be taken as a reference model given that non periodic drifts are bounded 
in amplitude by corrective actions commanded from the ground station, while periodic variations 
have periods much longher than the time span of a packet. Also in general it is assumed that the 
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mean meanj^ArskyJ = and the mean[A7'ioad] = but it is interesting to discuss even the case in 
which this will not be strictly true. 

1.2 Data compression and on-board processing 

The strategy adopted to remain inside the downlink bandwidth is based on three processing steps: 
i) signal downsampling, ii) signal conditioning and entropy reduction, iii) loss-less compression 
[Bersanelli et al. (2009), Miccolis (2003)]. A schematic representation of the sequence in which 
these steps are applied on-board and whenever possible reversed on-ground is given in Fig. 2. The 
figure refers to a single radiometer chain and it is ideally splitted into two parts: the upper part 
depicts the on-board processing with cyan boxes denoting the main steps. The corresponding on- 
ground processing is depicted in the lower part with the main steps coloured in yellow. Green pads 
represents the processing parameters. The first four of them are refered to as REBA parameters, 
and they are applied both on-board and on-ground. The parameters are: the number of ADC raw 
samples to be coadded to form an instrumental sample, A^aver, the two mixing parameters ri, r2, 
the offset O to be added to data after mixing and prior to requantization, and the requantization 
step q. The exact meaning of each of these parameters will be explained later in the text, when 
each step will be explained in full detail. The important thing here is to recall that the on-board 
parameters are imposed by telecommands sent from the ground. They are copied in each packet 
earring scientific data and on-ground they are recovered from the packets to be applied by the 
on-ground processing. The r factor is a parameter of the ground processing and is computed from 
the total power data received on the ground. The final products in the form of Time Ordered Data 
(TOI) either in total power or difi"erentiated are stored in an archive represented by the light-blue 
cylinder. 

Before entering into the details of the various steps it has to be noted that in principle a factor 
of two compression would be immediately gained by directly computing the difi"erence between 
sky and reference-load on-board, i.e. sending at Earth differentiated data. Although on-board 
differentiation seems straightforward ^ it implies at least a couple of major disadvantages. First, 
once the difference is made, separate information about the sky and the reference-load is lost, 
preventing an efficient detection and removal of other many second order systematics. Second a 
set of 44 r factors could be in principle easily uploaded on-board and applied to the data, but the r 
for each detector has to be fine-tuned on the real data. This would mean that the optimal r should 
be continuously monitored and adjusted to avoid uncontrolled drifts for each radiometer, but this 
is inpractical, having just 3 hours of connection per day. In addition, an error in calibrating the r 
will cause an irremediable loss of data. Therefore, the best solution is to downlink the sky and the 
reference-load samples separately allowing the application on the ground of the optimal r. 

1.3 Downsampling 

Each sky sample contains the sky signal integrated over a sky area as wide as the beam, but given 
each radiometer is sampled at a frequency of 8192 Hz the sky is sampled at an apparent resolution 
of about 1/2 arcsec. On the other hand the beam size for each radiometer goes from 14 arcmin for 
the 70 GHz to 33 arcmin for the 30 GHz. Consequently it is possible to co-add a number, A^aver^ 

^This was the baseline of the on-board processing for [Maris et al. (2000), Maris et al. (2004)]. 
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of consecutive samples producing averaged samples whose sampling time correspond to a more 
reasonable resolution without any loss of information. 

The value of A^aver depends on the beam-width, b^d, for the given detector 



'^spin^ovei-Sin^ 
^radysampling 



(1.13) 



Wspin [rad/sec] is the rate at which the satellite spins about its spin axis [The Planck Bluee Book (2005), 
Dupac, Tauber (2005), Maris et al. (2005)], /? is the boresight angle between the telescope line-of- 
sight and the spin axis, and Mover = 3 is the the number of samples per beam. Nominal values for 
the A^aver are 126, 88, 53 respectively for the 30 GHz, 44 GHz and 70 GHz frequency channels. 
The corresponding sampling frequencies in the sky are then 65 Hz, 93.1 Hz and 154.6 Hz, while 
samples are produced at a rate twice the sampling frequency. This drastically reduces the data rate 
that becomes about 85 kbps without introducing an important loss in scientific information. 

The output of the downsampling stage can be seen as a sequence of sky — reference-load 
couples ordered according to the generation time t 



' sky 



^load 7^. 



:0 



sky 
T'load 



' sky 



t=S, WloadJ,^ 



■-2S, 



Tsky 
T'load 



t=nd, 



where 6t = 2Afaver //sampling the samples are interlaced to generate a string of time ordered samples 
as 



-* sky Q5 load,0' sky load,^^? sky 2S;^ load,2^f5 ...,-/ sky ^j^^? -* load,n6t^ ' " " 

in a manner similar to the output of the ADC. But, while sky and reference-load samples in each 
ADC output buffer are consecutive in time, this is no longer true for the downsampled values. As 
an example, assuming a sequence from the ADC where the even samples are T^'^ and the odd 
samples are T^^^ (i.e., Tsky, 7'ioad sequences), then any Tsky^ will be the sum of A^avei samples with 
times between f and t + 2(Nayer - 1 ) //sampling while and Tioadr will span the time range t+l //sampling 
and ? + 2Afaver //sampling- While this small time shift is not very important while observing sky 
sources it might be thought to be relevant while attempting to correlate the observed signal with 
internal sensors, such as those used to determine the level of perturbation introduced by the active 
cooling. However, this problem is probably more theoretical than real as internal temperature 
variations do not occur on very small timescales. For simplicity, unless needed, in the remainder 
of the text we will omit to specify time in our formulas. 

1.4 Lossless compression, packeting and processing error 

The coding for the data stream in output to the compressor has to be optimized by taking into 
account the statistical distributions of the symbols in the input data stream. For this reason lossless 
compressors maintain an internal representation of the data distribution, such as the histogram or 
similar statistical indicators. In our case the selected compression scheme is based on a 16-bit, zero 
order, adaptive arithmetic entropy encoder [Herreros et al. (2009)]. The compressor assumes the 
data stream is represented by an uninterrupted list of couples of 16-bit integers. It does not take any 
particular interpretation of the content of the samples or of the order in which they are presented. 



-7- 



Simply it continues to code and store the data in a packet up to when the maximum packet length 
is reached. After that the packet is then closed and a new packet is opened. In establishing how to 
code the data, the compressor uses an adaptive scheme to decide the best coding for the input data 
as they are produced by the previous steps of the on-board pipeline. On ground the decompressor 
extracts the samples from each packet in the same order in which they have been introduced by the 
compressor. In this sense the couple compressor/decompressor act as a First In - First Out device, 
and becomes nearly transparent in the scientific processing of the data. 

The basic requirement for the packets produced by the compression stage is that of packet 
independency i.e. it must be possible to interpret the content of each packet independently of all 
the others. For LFI it means that the pipeline in the ground segment shall be allowed to generate 
from each single packet chunks of differentiated data. So the compressor has to store consecutive 
couples of sky — ^reference-load samples within each packet together with the information needed 
by the decompressor to interpret the compressed packets. In addition the compressor has to be able 
to self-adapt its coding scheme to the statistics of the input signal, without the need of any prior 
information on it. Finally, the compressor has to be fast enough to allow real-time elaboration 
of data with limited memory consumption. These requirements suggest the use of a compression 
scheme in which the compressor updates its internal statistical table each time it receives a sample. 
An empty statistical table is then imposed at beginning of the compression of a new packet assuring 
complete independence. When a symbol not present in the table is received as input a pseudo- 
symbol corresponding to a "stop message" is issued followed by the uncompressed new symbol, 
after that the internal statistical table is updated. If the symbol is not new the statistical table is 
updated and the symbol is compressed accordingly. On ground the decompressor starts with the 
same empty internal statistical representation assuming the first symbol is a stop followed by a 
new symbol, and updating accordingly its internal table as it receives symbols to decode or stop 
symbols. 

The efficiency of a compressor is typically measured by the, so called, compression d rate: 
the ratio between the length of an output string Lout derived from the compression of an input string 
of length Lin 

Cr=—. (1.14) 



Cj^' = ^. (1.15) 

^data 

It is well known that any lossless compressor based on entropy encoding has an upper Umit for the 
highest compression rate 

C7h = ^, (1.16) 
H 

where Ambits is the number of bits used for coding the samples and H is Shannon's entropy for 
the signal, which in turns depends on its probability distribution function (PDF). For an optimal 
compressor the theoretical for a digitized signal represented by integers in the range Qmin < 
Q < Gmax is given by 



-8- 



Cjh = (1.17) 
H 

Qmax 

fi = -Yj fQ^'^^^fQ- (1-18) 
e=emi„ 

where H is the Shannon entropy for the data stream, fg is the frequency by which the symbol 
or value Q occurs in the data stream, having limyg^o/glogi/G - 0> ^nd Juq=q Iq - ^- Non- 
idealities in the signal and in the compressor cause the effective Cr to be different from the expected 
Cj^ having Cj^ > C^. Usually this is accounted for by scaling C^^ by a multiplicative efficiency 
factor Tj. However its exact determination is a complex task described in some detail in Sect. 2.4 
and for the time being we will neglect it. 

From Eq. (1.17) and (1.18), to maximize d- we need to minimize for the input signal, forcing 
the reduction of its variance by requantizing the data. I.e. dividing the data by a quantization step, 
q, and rounding off the result to the nearest integer 

e = round (^i^j, (1.19) 



where O is an additive constant usually defined by asking 

mean[Z + (9] =0. (1.20) 
On ground the data are then decompressed and reconstructed by multiplying them by q. 

X = q[Q-0]. (1.21) 

Some information is lost in the process resulting in a processing distortion, e^, which in the simplest 
case is approximated by 



VT2 



(1.22) 



In [Maris et al. (2000), Maris et al. (2004)] we studied the case X = AT, there it had been shown 
that for Planck/LFI the statistics of the differentiated data stream was approximated by a nearly 
univariate normal distribution with cr = rms[A7'], and that after re quantization and reconstruction 
both Cj^ and where largely parameterized by the cr/q ratio with 



C, 



Th 



cr 



Ambits 
log2(V2^|)' 

-(- ' 



(1.23) 
(1.24) 
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1.5 The mixing algoritlim 



In general a data stream made of alternate sky and reference-load samples can not be approximated 
by a normal, univariate distribution. Two different populations of samples, with different statistical 
properties are mixed together. In this case the Ci could be reduce with respect to the univariate 
case. Furthermore, most of the first order instabilities, such as drifts and 1 //-noise, come from the 
radiometers, producing spurious correlated signals in Tsky and Tioad- So undifferentiated time-lines 
for Tsky and Tioad are much more unstable than the corresponding Ar timelines further reducing C^. 
In particular, fast drifts may rapidly force the compressor to saturate the packet filling it with the 
decoding information, in the worst case resulting in Q < 1. According to Eq. (1.23) It is possible 
to increase q to keep Cr within safe limits, but the log2 dependence will drive e^/cr to rapidly 
grow towards e^/cr^l. Alternatively a more complex compression scheme, one taking into account 
the sky — reference-load correlation could be implemented. But this would be computationally 
demanding and would increase the amount of decoding information to be placed in each packet. 

One is left with the need to recover the advantage of differentiated data, i.e. reduced instabil- 
ities and more homogeneous statistics, without losing the opportunity to have sky and reference- 
load separately on ground. The adopted solution is inspired by the principle of the pseudo- 
correlation receiver. Instead of sending to ground (rsky,rioad) couples, LFI delivers {T\,T2) cou- 
ples where each Ti, T2 is an independent linear combination of the corresponding Tsky and Tioad- 
Couples are then quantized and compressed. On ground the data are decompressed, dequan- 
tized recovering the linear combinations which are then reversed, recovering the original data 
[Miccolis et al. (2003)]. The most general formula for the linear combinations is 



here the matrix, M, in Eq. (1.25) is named mixing matrix (actually it represents a mixing and a 
scaling unless |M| = 1), its inverse M~' is the corresponding de-mixing matrix. The demixing 
matrix is applied on ground to recover the string of (T^sky, Tioad) out of the received string of (^i , T2), 
this imposes |M| 0. The structure of M determines the kind of coding strategy. A particular 
structure for M could better fit a given subset of constrains rather than another. Both the Cr and 
are determined by q as well as M. In particular it is obvious that the processing distortion will have 
the tendency to diverge for a nearly singular M. A detailed analysis of the whole set of possible 
structures for M is outside the scope of this paper, but in general M shall be optimized in order to 
i) equalize as much as possible the T\ and T2 statistics, ii) reduce as much as possible the effects of 
first-order drifts, iii) maximize the Cr, iv) minimize e^. For Planck/LFI the following form for M 
has been selected. 




(1.25) 



M 




(1.26) 



|M| 



r2-n\ 



(1.27) 



1 (r2, 
r2-r\ \ 1, 



-1 J 



(1.28) 
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which is not completely optimal, since it allows optimization only on a subset of possible cases, but 
has the advantage of having a reduced amount of free parameters to be uploaded for each detector ^ 
and it is directly suggested by Eq. (1.1). Given in this case |M| = r2 -ri It is evident how for a fixed 
q the distortion will increase decreasing |r2 -r\\. In nominal conditions the mean meanj^rsky] = 
2.735 K, the mean[rioad] = 4 iT, and a possible choice for r\ and r2 is ri = 1, r2 = r = 0.85. But a 
tuning procedure is required to determine the best parameters for each radiometer. 



a) b) 



Irteriaced Time Series 




sample rumber sampie rumber 




Figure 3. The effect of mixing and interlacing on time series (top frames) and distributions (low frames), in 
the output to the radiometer. Frame a) is an input time series, for sky (red dashed-line) and reference-load 
(blue dashed-line), and the corresponding mixed quantities gi (red full-line), Q2 (blu full-line) after mixing 
with ri = 3/4, /•2 = 1 /2. The range of values allowed by noise within ±5cr are represented in both mixed and 
not mixed quantities by the upper and lower dotted lines. The input time series has identical drifts on sky and 
reference-load equivalent to several noise crs and corresponds to a signal dominated time serie as defined in 
Sect. 1.1. Frame b) are the signals as seen from the compressor after interlacing, full-line without mixing, 
dashed-line after mixing. Frame c) is the effect on a normal distribution and on a ramp, green before mixing 
and red after mixing. Frame d) shows the effect on the projected distributions after interlacing. 



^Packets independency imposes that all the free parameters {N-Ma, q, O, r\ and r2) have to be stored within each 
packet. 



Mixing transforms the bi-variate PDF which is for Tsky, Tioad signals into that for Ti, T2. 
Fig. 3c represents its effect on the bi-variate PDF for the noise and the drift. Looking at the 
normal distributions of randomly variable signals in Tsky and Tioad, ^(T'sky. Txoai)^ the line for which 
giTsky, Tio!id)/g{Tsky, T'load) = 1 /2 is a dashcd line, and the equivalent line for or g{T\,T2) is a full- 
line. The distribution for the deterministic signal (either a ramp, a drift or a triangular wave) is 
instead represented by a segment, plotted again as a dashed line to denote the Tsky, Tioad signal, and 
as a full line to denote the Ti, T2- The effect of mixing is a combination of a non-uniform scaling, 
a rotation and a shift. The circle transforms into an ellipse. The line changes its tilt and length. 
In other terms, the covariance matrix for mixed data will be different from the original ones. A 
very interesting consequence is in the case of a normally distributed noise a correlated noise will 
appear in the mixed space even if the input noise is not correlated. In general after mixing the 
major axis of the two figures has the tendency to align with the x = y line and the center of the two 
figures shifts. In this case |M| = r2-r\ = \ and the size of the two figures changes proportionally to 
|M|. Interlacing transforms the bi-variate PDFs into univariate ones. Fig. 3d represents the effect 
of mixing on the PDF of interlaced data. Again dashed lines represents the distributions before 
mixing and the full-lines after mixing. As in Fig. 3b red indicates the Tioad or Ti and blue Tsky 
or T2- The bottom part of Fig. 3d represents the resulting distribution of interlaced signals, before 
(dashed) and after (full) the mixing. 

How these distributions have to be decomposed in terms of the projected distributions is shown 
in the top part of the Fig. 3d which shows separately the distributions of the random and determin- 
istic components, respectively a normal and a box distributions for T^-^y and Tioad; The resulting 
distribution will be the convolution of the two, which for Tsky and Tioad are very similar to a box 
distribution. Of course the drift makes the overall signal non Gaussian, in particular for Tioad- The 
central part of Fig. 3d is the equivalent for the mixed signal. Here the drift is reduced and the 
convoluted signals are more similar to the original normal distributions of noises. I.e. mixing not 
only reduces the distance between the two components but, by reducing the drift, make them more 
normally distributed. 

After mixing the (T\,T2) couples are re-quantized giving the quantized couples (Qi , Q2) which 
are interlaced and sent to the compressor 



where O is an offset added to force (Q\,Q2) to stay within the range [-2'^+2>5] 4. On ground 
packets are entropy decoded, the data streams are de-interlaced and the corresponding (Qi, Q2) are 
used to reconstruct the sky and reference-load samples 

''As anticipated in Sect. 1.3, to reduce the roundoflf error, tlie division by -/Vavci- is applied generating (Q\,Q2), in 
addition the parameter for digitization is not q but Sq = Ijq, so that Eq. (1.29) shall be written 



however for consistency with [Maris et al. (2000), Maris et al. (2004)] in the following we will omit the division by 
Waver and we will continue to use q in place of 5q. 




(1.29) 




(1.30) 
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Ta=Yj^~a]MQi-0)\, a = sky,load, (1.31) 

1=1,2 

where M~\ are the components of M"\ and the "x" is used to distinguish a reconstructed quantity 
out of a processed one x. 

Mixing will map sky — reference-load statistics in the corresponding mixed statistics 



Ti = Tsky - nTioad, «■ = 1 , 2; (1 .32) 

ATi = ATsky - nATioad, i=l,2; (1 .33) 

^ = ^ky + ^^Ld - 2'-,0-sky,load; (1-34) 

^^1,2 = f^sky + ri r2crl^ - ^^^-^crsky,ioad- (1 .35) 

A simpUfication for the components of the covariance matrix can be obtained by assuming osky 
as unitary, then defining fa = Ta/o-^^y, a = sky, load; and incorporating Rg- = CTsky/crioad in the r,- 
factors giving in these normalized parameters 



Ar = Tsky-rrioad (1.36) 
&] = \+f]-lhQs\\ (1.37) 

<T?2 = l + nrz-^^^si. (1.38) 

where = crsi£yjoad/(o"skyO"ioad), r, - rilRo-. The corresponding transforms for the expectations is 
more complex. Of course Ta = - sky, load, but 



Ti = f^i^y-Ro-ntioad, i = 1,2; (1.39) 
Af i = Afsky - /?<rr,- Af load, J = 1 , 2. (1 .40) 

(1.41) 

However these normalizations are very useful in discussing the compression rate especially after 
defining the obvious q = ^/ctsIq,. 

1.6 Modelling the statistical distribution of processed data 

We want here to define an approximation able to asses Cr > Cj^ in a simple way. For this reason 
we need to model the entropy for the signal entering the compressor. Of course the accuracy to 
which it is possible to predict the final Cr is directly connected to the accuracy to which the entropy 
is predicted. In the following we present two approximations for the entropy of the signal. A lower 
accuracy approximation and a high accuracy approximation. 
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1.6.1 The low accuracy approximation 

Considering the usual reference cases of a noise dominated signal and a signal dominated by a 
linear drift, in the first case the PDF may be approximated by a normal distribution, in the second 
case the PDF may be approximated by a uniform distribution with fQ = qlA values and a = AI Vl2, 
but in all the cases for ajq » 1 

/f = log2V-' (1-42) 
Q 

with fcpdf a constant depending on the type of p.d.f. ranging from VT2 for a uniform distribution to 
^2ne for a normal distribution. The difference in H between these two extreme cases is 0.25 bits. 
The argument of the logarithm is the number of symbols in the distribution. So H may be written 
also as H = log2A'3^j, with A^^^j, = k^fO-jq. Of course in the case of the uniform distribution 
^^^mh ~ ^symb- The PDF for the interlaced signals gives the probabiUty to have a symbol Q either 
from processes Qi or Q2. Then 

nO) = (1.43) 

with PiiQ), / = 1, 2 the marginal PDF for the Q, drawn from the bi-variate PDF 'P(Qi,Q2)- For 
our extreme cases both Pi(Q) are uniformly distributed or normally distributed according to the 
original PDF from which they are drawn ^. This allows one to neglect, in estimating the entropy 
of the interlace signal, their mutual correlation. Then the entropy for the interlaced data is just a 
fimction of the RMS for the two distributions cri, cr2, and their separation, Adistr, 

2 E[f2l-Effil 

Adistr=7 . . (1-44) 

Kpdf CTi + 0-2 

which is a normalized measure of the distance between the two peaks. After some algebra 

A _T n-r2 Tioad 

Adistr = 2— — (1 .45) 

ACpdf 0-1+0-2 

Then the entropy will be just a function of (Xi , 0-2 and Adistr- An exact analytical expression 
for H can not be obtained for this case. However, it is easy to see that in the limit |Adistrl » 1 the 
entropy takes the limiting value 

H^ = ^i±^ + 1, (1.46) 

giving 

Hoc = log2(fepdf) -1- log2( ^d-i&i) - log2 q+l. (1 .47) 

On the other side if Adistr = and &i = &2 the two PDFs collapse giving Hq = Hi= H2. In all the 
other cases Hq < Adistr) < ^^oo. The important point here is the assumption that 



'In this case the central Umit theorem does not apply to the signal with a uniform PDF given its deterministic nature. 
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Figure 4. Iso-countour lines of the left side of Eq. (2.52) for psi = -1 (left), Qsi = (center) and gsi = 1 
(right). In the first case the function is minimal in the (-1,-1) point, in the second in the (0,0) point, while 
in the third in the (1,1) point. 



H^Ho,, (1.48) 

would never overestimate the entropy by more than 1 bit or ^ 30%, so that neglecting the compres- 
sor inefficiencies a sufficient condition to asses Q > cJ^' would be 



withfftgt=A'bits/c7°\or 



/^co</^tgt (1-49) 



^J^-l^-2 < q—. ; (1.50) 



and so 



^{l +f\- 2rifti)(l + fl - 2r2fti) < q— ; (1.51) 

Eq. (1.48), Eq. (1.49), and the derived Eq. (1.50) and Eq. (1.51) represent our low-order approx- 
imation for the optimization of REBA parameters. In particular Eq. (1.51) puts a lower limit to q 
(and q) for a given Q, in fact for ri = r2 = 0, § must be larger or equal to 

^..C^') = ^H^T- (1-52) 

§opt = ^minCCj^') ^{l + r\- 2ri^si)(l + - ItiQa) (1-53) 

from which ^opt is simply derived as ^opt = o"sky§opt- This equation does not constrain completely 
^opt and for this reason we have to take into account the processing error as explained in Sect. 1.7. 



1.6.2 The high order accuracy approximation 

The accuracy by which ^opt is determined by Eq. (1.53) is solely determined by the accuracy of 
imposing H = Hex,. In itself, given the statistics of the input signal it would be not a problem to 
calculate by numerical integration H as a function of ri, r2 and q, but of course this would be 
quite expensive from a computational point of view. For this reason in Appendix A is derived a 
high accuracy algorithm to compute H based on simple equations from which ^opt could be readily 
obtained. However from the conceptual point of view the high accuracy method does not introduce 
any new detail in the discussion, the remaining part of this section refers only to the low-accuracy 
method unless otherwise stated. 

1.7 Processing error of the mixing/demixing algorithm 

The most important way to quantify the processing error is the measure of the distortion in the 
undifferentiated or differentiated data. The statistics of such distortions are taken as metrics of the 
quality of the process. For the undifferentiated data 



6q^ — Tq, T(y, 



(1.54) 



a = sky, load. By following the methods of [Maris et al. (2004)] from Eq. (1.31) and Eq. (1.54) it 
is easy to derive the covariance matrix of the quantization error, E^^afi = cov^6a,S/s^ 



_ 1 

12 (r2-r02 



/ 9 9 



r\+r2, 2 

The distortion of differentiated data is instead expressed by 



(1.55) 



Sm = (fsky - ffioad) - (T'sky " ''T'load); (1 .56) 

where r is the r determined on the processed data, which in general will be slightly different from 
the r determined on the original ones. However, assuming f ^ r from Eq. (1.55) the variance of 
Sm is 

2 _ (r2-r)2 + (ri-r)^ 

The first important fact which has to be stressed is that the variances of both errors are pro- 
portional to q^ - r\)^. Of course a nearly singular matrix with r2 ~ r\ will result in very large 
errors. In addition, Eq. (1.55) shows that, despite quantization errors for Q\ and Q2 are uncorre- 
lated, application of demixing causes processing errors in Tsky and Tioad to be correlated unless 

n+r2=0. (1.58) 

However, expanding the numerator of Eq. (1.57) produces e^^^^ocr^ + r^+lr^ -2r{r\ +r2) suggest- 
ing the important result that a not null correlation in the quantization errors may lead to a reduction 
of the error distortion in the differentiated data 

Another very important case is that in which either r\ = r or = r in this case Eq. (1.57) reduces 
to 
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Figure 5. Example of analysis of Qack factors in the (ri,r2 - n) space. Various cases for different values 
of Tsky and Tioad are considered. The yellow region is the allowed region when all the Q^ckj < 1 , the blue 
region is the forbidden one. Thin dashed lines are the limits of allowed regions (3ack,i with positive (red) 
or negative (yellow) drifts and Q^ck,! for positive (green) or negative (violet) drifts. Thin full lines are the 
allowed regions for (3ack,i and Qack,2 whose intersection is marked with a white thick line. Values of Tsky, 
cTsky, ATsky and the corresponding for Tioad are between parentesis in the title of each frame in the order 
(mean, sigma, drift). 



1.8 Saturation 

A further source of processing error is the saturation of the dynamical range of the data format used 
to process the data, or simply Saturation. 

Saturation occurs when the argument of the y = round[x] function exceeds the maximum 
range of values allowed by the computer to represent our results. Typically what happens is that 
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a) 



b) 



Sky(-2,1,D);Load(2,1,0) 



Sky(-20,1,25);Load(20,1.25! 



-2 D 2 



-4-2 2 



c) 



d) 



Sky(-20,1,D);Load(2D.1,25) 




Sky(-20,1,25); Load [20,1 ,-25) 



r 



-4-2 2 



-4 -2 2 4 6 



Figure 6. Example of analysis of (Sack factors. See caption of Fig. 5 for explanation of symbols. 

y = round[x] returns an Ambits signed integer. If \x\ > 2^'^'^"^^ an overflow or an underflow wifl occur. 
Depending on the implementation of the y = round [x] function the value of y could be either forced 
to be ±2^^ with the sign depending on x, or modular arithmetic could be applied so that as an exam- 
ple a too large x > could be mapped into y <0. In all cases the whole subsequent reconstruction 
will produce meaningless results. So it is fundamental to avoid saturation. 

For this purpose we define as a quantitative index for the saturation the instantaneous Qack 



ratio 



Qack,/(0 = 7,1 =1,2. 

Saturation occurs if at some time \Qack.M\ - 1 and the non-saturation condition is 



(1.60) 



iack,l(f)l<lA|ack,2(0l<l; Vf (1.61) 

in general this will put limits on ri, r2, q and O. Assuming to have applied the optimized offset of 
Eq. (2.1) the linear combinations are 



From Quantization Alarm Check. 
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AT,ky(t) - riA rioad(0+ V^rioad + n(T„,i 



^ack,l(f) = ^ ^ (1.62) 



^-k,2(0 = (1-63) 

where fi ^ 5 is used to assess a safety region against random fluctuations. 

In computing Q^ck the eff'ect of mutual cancellation of extremal values must be considered. A 
conservative estimate would be to propagate the modulus of each variation 



maxCAr,) = max lAr^kyl + max lA^ioadl + 



ri-ri 



\Tio,d\ + ncri, (1.64) 



2 

with min(Ar,) = -max(Ar,), but it is better to explore the various combinations of minima and 
maxima within Eq. (1.62), producing a set of partial Qack indexes which have to be independently 
satisfied. An example of such method is illustrated in Fig. 5 and Fig. 6. 

In general, the separation between Ti and T2 is a function of time, whose measure is given by 
the divergence Vr, a parameter just sensitive to AT^ioad and Tioad 

^^..x ,(r2-ri)(Arioad(f) + 

VT{t) = -2 ; — ; (1.65) 

Kpdf(0"l +0-2) 

of course \VT\ will be constant when AFioad = 0. 

To determine the region of parameter space rj, r2 which satisfies Eq. (1.61) it is most conve- 
nient to work in the (ri , r2 - ri ) space, there the most general condition is 



ri(a—l)-b <r2 — ri< r\{a-\)-c\ (1.66) 
c-{r2-ri){a-\) <(fl-l)ri < b-{r2-ri){a-\); (1.67) 

with the dimensionless coefficients 

/2Arioad ,\ ^ON 

a = \— + 1 ; (1.68) 

V ^load / 

b± = J— (Arsky + ^2^"*-' ± no) ■ (1 .69) 

T'load 

c± = J— (AT-sky - ^2^^«^-> ± no) ■ (1 .70) 

7'load 

cr = max(cri,cr2); (1.71) 
For M = those conditions define a diamond-shaped region whose vertices are 
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Note that B : and D : lie on ^2 - ri =0 line, while A : and D : are above and below it, the exact 
ordering depending on the signs. The center of the diamond-shaped region is locate on ((b + 
c)/(a- 1),0). If i> + c = the region is centered on the origin of the Cartesian system, A :, C : 
and fi :, Z) : are mutually opposed. In the case a = 1 the region degenerates into a band parallel to 
^2 - ri = 0, and bounded by -b <r2-ri < -c. 

First considering the stationary case, having Arsky = 0, Arioad = 0, then a = \, b = -c the only 
constraint for n = in this case Qack.i and Qack,2 defines the same condition which is 

|r2-ri|^<^7V.at. (1.72) 

identifying simply a band in the space center around the r\ = r2 line. But the effect of noise has to 
be considered i.e. « > so that now there is a difference between a+ and fl_ so that Qack.i defines a 
vertical band and Qack,2 a diagonal band whose intersection is the diamond-shaped region above, 
see Fig 5a. Changing the ratio between cr„ 2/cr„ j will not change the shape but just the size of 
the region. Fig 5b. Changing T^ky and Tioad instead will change the shape of the allowed region. 
Fig 5c, Fig 5d and Fig 6a. If Tioad = the saturation condition becomes |1 - r,j • \A(t)\ < qNsm i.e.: 
1 - ^^'satMmax < < 1 + ^A^satMmax. with A^ax = max(|A(OI). This defines a rectangular region 
with diagonal ri = r2. In the limiting case for n = 0, A^ax — > the allowed region becomes the 
whole plane, while in the opposite case -Ajyiax — ^ *^ allowed region shrinks toward ri — ^2 — 1- 
Perturbations in the sky channel, such as the cosmological dipole, introduce a fluctuation which 
affects just the sky, in this case Arjoad = and a= 1. Even here the simplest case is Tioad.o = 0, 
or the forbidden ri = r2 in that case max{\D{t)\) < ^A^sat is a sufficient condition which puts a 
limit just on q. In the most general case from Eq. (1.66) the limits of the allowed region are 

-2Arsky/rioad,0 - 2<jA^sat/7'load,0 < ''2 " ''l < -2A7'sky/rioad,0 + 2gA^sat / T'load.O 

Other effects, such as instabilities of the 4-K reference-load, could affect just Tioad in this case 
Arsky =0, b = 2^A^sat/7'ioad,o = -c cvcu here the simplest case is that in which Tioad = 0, in that 
case max(|A(f)|) < ^A'sat is a sufficient condition which puts a limit just on q. In general a diamond- 
shaped region symmetrical around the origin represents the allowed region. This is represented in 
Fig 6c. 

Drifts in the gain of the amplifiers in the radiometers, such as those produced by thermal effects, add 
correlated or anticorrelated signals in sky and reference-load. So the model case to be considered 
is the one in which A(t) ^ A7'sky(0 ~ A7'ioad(0- In this case with a = (b + c)/2 and Eq. (1.62) and 
Eq. (1.65) define the regions in Fig 6b and Fig 6d. 

It is interesting to consider the range of values assumed by Pi and P2 with A in the range 
Alow < A < Aup then \Pi,up- Pi, Lov/\ = l^up-Aiowl • |1 -ri\. So the condition to have identical ranges, 
is ri t r2 and r2 + ri = 2 or (r2 - ri) = 2 - 2ri 

It has to be noted that in nominal conditions, Tsky.o ~ 2.73 K, Tioad.o ~ 4 K while Tsky, Tioad 
fluctuations are expected at the level of at most some 10"^ K giving Arioad/T'ioad.o ~ Arsky/rioad.o ~ 
somex 10"^, and hence a~ I, b ~ -c. However outside this case, in particular during testing and 
the cooling in the transfer phase these conditions could be severely violated. 

2. Optimizing the On-board Processing 

The optimization of the algorithm consists in determining the "best" combination of the set of 
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processing parameters i.e. the "best" n-tuple A^aver- ri, ¥2, O, SqOV q. It is mandatory that the 
optimization procedure will keep within safe limits d = cl^. 

The classical approach would require a function of merit and a searching algorithm through the 
corresponding NxR'^ parameters space to be applied to each of 44 detectors. However a reduction 
of the cardinality of space comes from the fact that, by-requirement, the nominal A^avei is fixed by 
the oversampling factor for the beam, so apart from the cases in which a different oversampling is 
required, the A^aver in nominal conditions is fixed. The only cases in which A^aver could be varied 
are: i.) sampling of planets for beam reconstruction; ii.) ground testing and diagnostics. The 
first case occurs when the beam has to be reconstructed with higher detail than the one reachable 
with the nominal oversampling factor Wover = 3. So it is possible to ask the on-board processor to 
decrease A^aver increasing proportionally the data-rate from the feed-horns which will be affected 
by a planet. To arrange the higher-throughput of scientific telemetry, q will have to be increased, 
increasing proportionally e^. In the second case the value of A^aver could be varied either to increase 
the time resolution, as an example if sampling of some perturbation characterized by time scales 
compatible to A'aveiV/sampiing has to be investigated, or if some temporary shortage in the telemetry 
rate is imposed, asking in this case to increase A^aver- Also while testing on ground for long term 
drifts, the sky is replaced by a dummy load at constant temperature. In this case time resolution is 
no longer an asset and A^aver could be increased. A further reduction of the parameter space to be 
explored comes from the fact that usually O is optimized in order to have mean[7'interiaced +<5] = 
where Tinteriaced ^rc the interlaced samples produced after by Eq. (1.26). It is then easy to derive 
thatmean[rinteriaced+<5] =mean[ri] + mean[r2]+2C) so that ^optimal = -(mean[7'i] +mean[r2])/2, 
and with some simple algebra 

<5optimal = ~Tsky "I ^ T'loadj (2.1) 

where the mean has to be computed over a suitable time span. What remains is a %^ parameter 
space to be explored {ri,r2,q). 

2.1 Target function 

The target function ;^^(rl,r2,^) for the optimization would i.) asses Cr = cj^' to be kept within safe 
limits; ii.) asses to be kept as small as possible; iii.) asses additive constrains. These constrains 
do not allow a unambiguous definition of a target function. As an example, even for a stationary 
signal dominated by white noise, Cr computed on each packet is a random variable. So the question 
is whether = cj^^ has to be interpreted strictly, i.e. forcing each packet to have Cr = cj^^ or on 
average leaving space for lower and higher Cr? In general it would not be critical if some fraction 
of the packets would be compressed at a rate lower than Cr The requirement on is even worse 
defined. Of which €q are we speaking? As shown in sect. 1 .5 it is evident that there is not a general 
definition for e^. Depending on the scope of the data acquisition it could be more interesting to 
have a low for Tsky or Tioad or Ar computed for some reference r. More over neither nor 
£q.dis functions with a minimum and they vary over the full range of positive values. In addition 
within a pointing period A^ repeated sky samples are acquired. In making maps repeated samples 
are averaged and will be reduced by a factor 1 / Va^ [Maris et al. (2004)]. So a relatively high eg 
could be acceptable at the level of single samples when observing stationary sources. However the 
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Figure 7. Iso contour lines for the functions entering e^^diff and e^ jiff as a function of (ri, r2). Vertical blue 
dashed lines cri , horizontal blue dashed lines era, black dashed lines (^2 - r\)-^, blue contours (ri - r)^ + (r2 - 
r)^, red dashed contours y 0-10-2, and the red contours are the resulting e^y difF- The width of the lines varies 
with the value of the function. 

ratio between eg and the noise will not change after averaging. So a convenient choice would be to 
consider eq/cr, where cr could be the RMS of T^ky, Tioad or Ar depending on the case. The only 
hard constraint which has to be considered is that needed to avoid saturation. 
The general formula for the target function is 

A^(0) = ]~[a(0)"', (2.2) 

c 

is a vector in the parameter space, Qc is a function varying over the range [0, 1] with Qd®) = 1 
if fits the particular criterion c for which the function is defined, Qc(@) = if does not fit this 
criterion. Intermediate values may be also defined in the [0, 1] range measuring the goodness of fits. 
As an example, a criterion for optimal q is to have ydiff = niin(7diff), the corresponding criterion 
function is Qc(&) = min(7di(f)/7diff. The exponents lie ^ 0, with Yjc^c = 1 are weights defining 
the relative importance of each criterion within a given policy. In general is is better to have Qc(&) 
which are derivable. In some case it is necessary to to deal with poles that have to be avoided. A 
solution is to define a metric jUf(0) > which could have a single pole for which yUc(0) — > +°° and 
take Qc(&) = e~^'''■'^^ or Qd®) = 1/(1 +£'"'''<®^). Typical criteria are shown in Tab. 1 

2,2 Analytical Optimization 

Analytical optimization (AO) is based on analytical formulas assuming either normally distributed 
or uniformly distributed signals. As a starting point for more refined numerical optimization. At 
the root of this method of optimization is the requirement of minimizing the processing errors, 



Table 1. List of possible criteria 
Condition Criterium 





= min(6g,diff ) 


Qc 


= min(eg,diff)/eg,diff 




= min(6^,sky) 


Qc 


= min(eg,sky)/e<?,sky 


^^,load 


= min(6^,ioad) 


Qc 


= min(egjoad)/eg,ioad 


^^,load 


= ^q,diff 


Qc 


= 1 - k?,load - f^.diffl/ max(egjoad + fg.sky) 




— ^^,sky 


Qc 


= 1 - k^,load - e^.skyl/ niax(e^joad + eq,sky) 


f^,diiF 




Qc 


= 1 - kij.diff ~ fg.skyi/ niax(e^4iff + e^,sky) 



a) b) 




Figure 8. Comparison for e^^^diff = Q/,sky top-left, e,,diif = Q/.ioad top- right, e^^.sky = e,^_ioad bottom-left and 
the corresponding best fit regions bottom-right; as a function of (ri, rj). The lines are iso-contours of 
exp(-|.\:|) where x is the difference between the two arguments to be compared. For the bottom-right frame 
the iso-contours are tracked for cubic root of the product of the comparison functions. The simulation is for 
7" sky = T'load = 0, (Tsky = cTjoad = 10 ADU, r = 1. No drifts are included. 

as an example the e^diff- However given they diverge at ri = r2 it is necessary to consider the 
maximization of their inverse normalized to the minimal value, as an example defining for diff 
the function Fdiff = min(e^diff)/fi?,diff- These functions become for fx = ^2, unless either ri = r or 
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a) 



b 



m =0,m =0,(! =10,o = 10,r = -lnl 




-3-2-10123 



C) 



d 



m =0, m =0,0 = 10, a = 10, r = -lnt niin(y )/y 
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Figure 9. Iso contour lines identifiing the regions in the (ri, r2) space where min(e£^_(jiff)/e^_diff top-left, 
min(e^,sky)/e?,sky top-right, min(efy,ioad)/e£/,ioad bottom-left, and where all of the are as near as possible 
at their minima, bottom-right. The lines are isocontours of min(x)/x which is 1 when the minimum is 
reached. For the bottom-right frame the iso-contours are the cubic root of the product of the other frames. 
In the top-left frame white + denotes the positions of the analytical solution, the black thick contour is 
for min(e^.diff)/e^,diif = 0.5, the black thin contour lines are for min(efy_diff)/e(y,difF assuming H = H^a- The 
simulation is for Tsky = T^ioad = 0, (Xsky = crioad = 10 ADU, r = 1. No drifts are included. 



r2 = r 

Again it is convenient to use the normalized parameters r,, in this case the covariance matrix 
of processing errors is 



12(r2-ri)2 



i;2 , f.2 r,+r2 \ 



(2.3) 



In the framework of the low-level approximation for ^opt calculation, after replacing Eq. (1.53) into 
the e^^difr from Eq. (1.57), substituting r, — > r,-, q and r — > r = r/i?o-, and taking its reciprocal 
one obtain 



-24- 



Ta. = f'-''^ = (2.4) 

[(ri - r)2 + (h - r)2] -^(1 + ^2 - 2ri£,,i)(l + rj - 2r2fti) 

Tdiff is symmetrical with respect to the axis ri = f2 and has a maximum where the processing error 
has a minimum. There is no analytical way to maximize Fdiff . However, Fig. (7) shows the contour 
plot for the various components of this function. The denominator is the product of a function 
which is constant over circles centered on ri = r2 = r (or n = r2 = r) and which increases going far 
from that point, and ^ld-\&2 which has a more or less elliptical form and that for = is centered 
on f\=f2= 0. The numerator is null for f\ = f2 line of constant numerator are parallel to f\ = f2 and 
increases going far from that line. Hence the Fdiff maxima must be symmetrically aligned along a 
line normal to fi =f2. The line has to cross the f\ = f2 line at f\ =f2 = r^, with < rc < r, so that 
the maxima for Fdiff are located at 



ri ^ Tc ± ^Cin, (2.5) 
V2 

f2-rc + —=lmn\ (2.6) 

V2 

where fmin measures their distance from the fi = f2 line. Numerically it is possible to show that in 
the case q^x = sufficient numerical approximations to ^min and as a functions of r are. 



0.6994r + 0.2722 



(2.7) 



0, 0<r<0.701 
-3.0836r2+6.7034r + 3.0969, 0.701 <r< 1 
-0.3985r2 +0.3367r- 0.4369, 1 < r < 10 



(2.8) 



Fig. 9a represents a typical pattern for Fdiff(ri,r2) (the r, are converted into r,). It is assumed 
r = l/7?o-. The optimization produced by maximizing Eq. (2.4) could be improved by using the 
approximation for the entropy in Sect. A which takes into account of possible overlaps between 
the Q\ and Q2 distributions, allowing a better approximation to ^opt(''i,''2). So in the figure Fdiff 
have been computed by using the method in Appendix A but black contour lines are those obtained 
assuming H = Hca at the root of Eq. (2.4) it is evident that the two approximations agree quite 
well. Crosses mark the position of maxima calculated with the approximated solution described 
above. The Qack factor for this case does not reveal any saturation. So it is possbile to look for 
other combinations of optimized parameters. As an example Fig. 9 b) and c) are the equivalent of 
Fdiff computed for e^ ^ky, and e^joad. Of course while Fsky has well defined maxima this is not true 
for Fioad given e^yjoad has not an upper limit. Fig. 9 d) represents the product of Fdiff and Fsky. We 
may look at combinations of parameters where, e^,sky = fg.ioad or e^ ^ky = eq,m or e^joad = f<?,sky as 
in Fig. 8a,b and c or e^ ^iff ~ e^joad ~ £q,Ay and thus use the product of the second group of criteria 
in Tab. 1, assuming all the H^ = 1 /4. as shown in Fig. 8d. 
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2.3 Dealing with saturation 

A more complex situation could arise if the selected optimal ri, and ^opt lead to saturation. In 
this case either a (ri,r2,^opt) far from the Fdifr peak has to be selected or ^opt has to be increased in 
order to reduce the corresponding GL^a factor. 

In the first case the requirement Cr = C^^^' will be assessed but the quantization error will be 
larger than the optimal one. To limit this error the new (ri,r2,^opt) would have to be selected as 
much as possible along the ridge near the Fdiff peak ad as much as possible far from the r\ = line. 

In the second case we consider the fact that Qack 1 so it is possible to take r\ and ^2 at the 
FdifF peak but to take 

o /--(max / peak peaks 
-JopLQack = •SftQ^^l^l^optCq ) 

where = niax(|Q^^^''j,|Q^^^'^|) and Six_ > 1 is a safety factor, which typically is Sfi = 2. Of 

course in this case the data are compressed at an higher rate than cj^^ while the processing error 
will be increased by a factor «SftQ™j . 

2.4 OCA2K, non idealities and numerical optimization 

Non-idealities in the signal and in the compressor cause the effective to be different from the 
expected Cj^, and in general Cj^ > Q. A formal way to account for this is to define a compression 
efficiency rjc^ < 1 defined as: 



^c. = §k' (2.9) 



which is the product of the contributions of each non-ideality. In general it is very difliicult to 
account in a satisfactory way for even the most important non idealities as is illustrated by the 
following examples. 

A group of non-idealities comes from the fact that each time a new symbol is discovered in 
the data stream the compressor adds at the compressed output a "stop" pseudo-symbol followed 
by the uncompressed symbol. Then the compressor is coding the symbols in the input data stream 
plus the "stop" pseudo-symbol and consequently the entropy for the compressed data stream to be 
introduced into Eq. (1.17) is changed by a factor t]~^^^ 

_1 <^stop , tfislop l+log2(l+^stop) ^, 

'7stop = — 1082 7- + — 7777- ^ — ' (2-10) 

^ l+^Cstop l+V'stop H(l+ipstop) 

where ipstop = A^symb /"samples, -^symb IS the number of difi"erent symbols in the packet, ^samples the 
number of samples stored in the packet and H comes from Eq. (1.17). Note that for ^stop — > 0, 
'7stop — > 1 ■ In general for small ^stop the addition of stopping symbols increases the entropy leading 
to ?/stop < 1, but when i/Jstop is sufficiently large the compressed data chunk is diluted in a large 
number of repeated symbols, reducing the entropy of the signals and giving ?7stop > 1- However the 
potential gain in Cj^ is compensated by the need to add uncompressed symbols. If A^bits.code is the 
number of bits needed to store the information used to decode a symbol, Lpck is the length of the 
packet, then from the condition 
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"t" ^symb^bits,code — ^ck) 



'Th 



r 



and from Lin = isampies^^bits, assuming the optimal case Lout = ^-pck the dumping factor for the 
compression efficiency is derived 



In general, for a stationary signal A^symb "sampi 

es SO that 7/store is a sccond order correction which 
will be neglected in the remaining of the text, but it becomes an important factor for the case of 
non-stationary signals for which A^symb ~ "samples, which could occur in case of fast drifts. 

Two non-idealities very complicated to be analyzed are the difference between the expected 
entropy and the sampling entropy, and the compressor inertia. 

The theoretical estimates of entropy and hence of the expected compression rate, gives the 
expected entropy calculated on an ideally infinite number of realizations of samples. This means 
that even very infrequent symbols for the samples are considered by theory. But the compressor 
stores a few hundreds of samples for each packet leading to a truncated distribution of samples and 
consequently to a sampled entropy which in general is smaller than both the theoretical expectation 
and the entropy measured on a long data stream. In theory if G(Q) is the cumulative PDF for 
the distribution of samples, and if Q is bounded between Qi„f and Qsup it would be sufficient to 
rescale the fg by l/(G(Qsup) -G(Qinf)) and redefine accordingly the sum in the definition of the 
Shannon entropy. As an example, in the case of a simple normal distribution cutting the distribution 
respectively at 1, 2, 3 and 4cr will reduce the entropy as predicted from Eq. (1.42) respectively by a 
factor ?7c, = 0.79, 0.89, 0.95 and 0.98. However, the difference between theoretical entropy, or even 
the entropy measured on long data streams, and the sampling entropy measured on short packets 
could be changed by the presence of correlations in the signals on scales longer than the typical 
time scale of a packet. Last but not least, it is necessary to consider that the compressor takes some 
time to optimize its coding scheme, leading to a further loss in compression efficiency. 

The effect of all of these non idealities are too complicated to be introduced in the theoretical 
model, so that the tuning of REBA parameters based on the theoretical models has to be refined 
by numerical optimization. Numerical optimisation it is important because the handling of difficult 
cases in which the hypothesis of the theoretical model completely fails, it allows experimentation 
with artificial perturbations introduced in the signal and it includes higher order effects such as the 
packet-by-packet variability of Cr, In addition numerical simulations must be used to verify the 
optimized parameters before uploading them to the instrument. 

With these aims the Onboard Computing Analysis (OCA) software was developed, composed 
of a scanner, able to run the same test on diff'erent combinations of REBA parameters; an analyzer, 
able to automatically extract relevant statistics on each test; an optimizer, able to apply different 
policies defining when a combination of parameters is optimal or not selecting the best combi- 
nations; a report generator, used to generate automated reports. Apart from REBA optimization 
the development of the OCA libraries has been driven by the need to have a flexible environment 
for testing ground segment operations as explained in [Fraillis et al. (2008b)]. Hence, OCA is able 
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to read, decode, and process small amounts of raw data coming from the Planck/LFI scientific 
pipeline from packets to complete timelines. 

At the core of the part of the OCA software dedicated to the REBA optimization there is a C++ 
kernel, (0CA2K) which processes the input data for each combination of REBA parameters perform- 
ing: i. on-board mixing and quantization by using the real algorithm; ii. on-board compression by 
using the on-board algorithm; iii. on ground decompression and reconstruction. 

It has to be noted that 0CA2K uses the same C code for compression operated on-board. So it 
does not emulate the compressor but uses the real compressor. In addition the validation of proper 
emulation of the on-board and on-ground processing has been provided by using data generated 
in the framework of the validation of Level-1. of the Planck/LFI DPC [Fraillis et al. (2008b)]. In 
that way we demonstrated that 0CA2K processes the data in the same manner as the real processing 
chain. 

The input of OCA and 0CA2K are short data streams of raw data downloaded from the instrument 
just before on-board averaging or just after it, see Fig. 2, depending on whether A^aver has to be 
optimized or not. In output 0CA2K provides measures on a packet-by-packet base of and its 
related quantities such as the estimated packet entropy, or the measured compression efficiency 
?7oca2k- It provides also sample-by-sample estimates of critical parameters as e^, and Qack- 

Despite 0CA2K is written in C++, it remains a heavy, offline tool, which can not be directly used 
for a crude exhaustive real-time optimization. This is the reason for which analytical methods have 
been developed. On the contrary OCA has the ability to use the analytical models to focus on the 
relevant region of parameters space. 

OCA allows the determination of the optimum parameters according to different optimization 
strategies and constraints. This is important given the different ways in which REBA parameters 
are optimized during ground tests and in flight. During ground testing the usual procedure has 
been to stabilize the instrument and its environment, calibrate the DAE and then to acquire chunks 
of about 15 minutes of averaged data to be analyzed by OCA to optimize the REBA parameters 
[Cuttaia at al.(2009)]. After setting the REBA parameters another 15 minutes of acquisition, this 
time with the nominal processing described in Fig. 2 is executed as a cross-check. 

OCA could be used as a stand-alone application, but different interfaces for OCA to other pack- 
ages have been created for different applications. For ground segment testing OCA provided an 
IDL and C++ library used in a stand-alone program. The same occurred for the Planck/LFI simu- 
lation pipeline where parts of the 0CA2K simulating on-board preprocessing and ground process- 
ing, (excluding compression and decompression) have been included in the Planck/LFI simulation 
pipeline. For the REBA optimization during the ground tests, OCA has been used within the LIFE 
framework [Tomasi at al.(2009)]. For routine operations in flight OCA has been included in the 
PEGASO [Tomasi at al.(2009)] software tool designed to monitor the instrument health and perfor- 
mances at the Planck/LFI DPC. 

2.5 The 0CA2 optimization algorithm 

As a premise to REBA processing optimization, a value for A^aver, a cj^' and a function of merit 
X appropriate to the case under analysis have to be fixed. As explained, in general A^aver is already 
fixed by other considerations than REBA processing optimization. A slightly higher than needed 
^' is taken in order to allow some margin. While the Fjiff is considered a sufficient function of 
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merit, but more complex functions, such as those in the family of functions presented in Eq. (2.2) 
are used as well. 

A data chunk long enough to allow the generation of about a hundred compressed packets, 
is acquired for each radiometer. In general, the data chunk is on-board processed by allowing 
coadding for the the given A^aver- For that chunk relevant statistics such as Tsky, Tioad. CTsky. o"ioad> 
o"sky,ioad, AT^sky, Arioad are measured and from that r and Ra- are evaluated. 

The analytical optimization is performed in order to determine in an approximate way the 
region of ri, r2 where the function of merit could have a peak; to grid the region ri, r2 (typically 
by regular sampling); to determine for each point in the region the function of merit x{ri,r2) and 
(r°^'™, Tj^'™) as well as the {r\ , r2) for which ;^'(rl , r2) has its absolute maximum; and finally for the 
previously determined (r°^'™, r°^'™) the (9°?'™ = C)(r°'"™, r"'"™) and the 9*,;^^^ = ^opt('"i'"™> r"'"™) 
are determined. After that max(|<3^^|^(rj'"™,r2'"™)|) i.e. the maximum value of |Qack,i(OI among the 
fiack values determined on the data chunck for g = 1 (see Eq. (1.60)) is measured. From max(|Q^^j^|) 
^optim could be corrected for saturation. In fact, if max(|Q^^j^|) < (1 ->5ft)^*,i^ the analytical 
optimization returns (j^^j as the best estimate of q otherwise it forces g^jj^jj = max(|Q^^]^|)/(l -Sft). 
In the latter case ij* ,. is said to be saturation-limited and of course in that case it is expected to 

^ optim ^ 



have Cr > cj^\ 



After the analytical optimization the jj^^^ has to be numerically refined in order to take into 
account the non-idealities of the compressor. If ^^jj^^j is not saturation-limited the 0CA2K is oper- 
ated to determine, by a polynomial search, the best ^opt allowing Cr = cj^^ for given r"'"™, Tj*"™, 
and In general the search is performed for q in the range max(|Q,'^^j,(rj'"™,rj''"'")|)/(l -Sft) 

and 2o* . If is saturation-limited the numerical procedure could be in principle skipped. 

-'optim -'optim ^ f f ff 

However non-idealities could cause Cr < cj^' even in this case and to check for this a single run 
of the numerical optimization is performed for the selected parameters. If Cr > cj^' the procedure 
is concluded, otherwise the polynomial search is performed. 

When ^opt have been numerically refined a last run of the numerical code for r"'"™, rj'"™, 
^optim ^jj^ is performed to asses the processing error and the histogram of the compression 
rate. 

The typical time to perform the optimization sampling (ri , r2) with a grid of 25 x 25 samples 
and a TOI of about 15 minutes of data, is about 20 sec, so that the optimization of the whole set of 
44 detectors takes less than 15 minutes including the overheads for data lO. 



3. Results 

Here are discussed the results of REBA calibration and optimization in the framework of the 
Planck/LFI ground tests. We look at first at a single test to compare the analytical and the numeri- 
cal optimizations. We then look at the results of the calibration for the whole set of 44 detectors in 
a real case. 

The results of analytical v.z. numerical optimisation are compared by using real Planck/LFI 
data acquired during the RAA tests of the instrument performed at Thales Alenia Space (Italy), 
during the summer of 2006. Fig. 10a shows 12 min of data with A^aver = 52, equivalent to about 
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Figure 10. Example of signal for the detector 2300 acquired during the tests and its optimization. Frame a: 
are the sky and reference-load and AT samples in analog-to-digital units (ADU). The differentiated signal 
has been shifted up, having null mean. Relevant statistics are reported in Tab. 2a. Frame b: represents the 
accuracy by which the analytical model is able to predict the entropy measured on the quantized signal as a 
function of r\ and for ^ = 1. Contours of regions are for accuracies of 0.5%, 1%, 1.5%, 2%, 2.5%. White 
boxes denote the regions where an accuracy worst than 3% is obtained by the simplest analytical model. 
Outside these regions the two models are completelly equivalent. Frame c: represents the accuracy by which 
the most sophisticated analytical model presented in Appendix A is able to predict the Cr for g = 1 . Contour 
lines are for accuracies of 5%, 10%, 15%, 20%, 25%, and 30%. Frame d: compares Fjiff for the analytical 
model with Fdiff computed for the numerical model. Colours are for Fdiff analytical equal to 0.1, 0.2, 0.3, 
0.4, 0.7, and 0.9 while contours are for the same values of Fdiff numerical. The black hexagon identifies 
the regions for the peaks of Fjiff analytical, the crosses for the peaks of Fjiff numerical. Frame e: the same 
as of Frame d, but for Fsky. Frame f: in black on the left, the histogram of Ci- obtained on the 111 packets 
produced by the compression procedure with the optimized parameters of Tab. 2b. The figure gives also the 
5% and 95% percentiles (p05 and p95) and the mean, and in red a gaussian fit of the histogram. In addition 
the figure shows in red at the right the Ci expected from the entropy directly measured on each packet, and 
the theoretical compression rate expected from the theroretical model (CrTh) and from the entropy measured 
on the whole quantized TOI. 



56715 samples, while the Tab. 2a gives the relevant statistics of the TOI. During the test the instru- 
ment and its environment was stable, no strong drifts are present in the data. A clear correlation 
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between sky and reference-load is evident in the plot explaining the f^si ~ 1 ■ and the factor of six 
reduction of the RMS when passing from undifferentiated to differentiated data. Also the separa- 
tion between sky and reference-load is not large, being just 18cr. So after mixing the distributions 
for V\ and V2 will stay well separated, with |Adistrl > 100, when |r2 -r\\> 0.2. In this case it is rea- 
sonable to expect that both the low-accuracy and high-accuracy methods to estimate analytically 
the entropy will give comparable results. 

Indeed, we used both models to optimize the REBA parameters taking cj^' = 2.4, and both 
models produced exactly the same results reported in the second column of Tab. 2b and Tab. 2c. 

To test the goodness of the AO, 0CA2K was run imposing q=\ and taking the same values of 
rl, r2 used for the AO. The predicted entropy of the processed TOI is compared in Fig. 10b. There 
the relative difference between the entropy measured all over the TOI and the entropy computed 
analytically by using both methods is reported. Patches define intervals of accuracy in steps of 
0.5% up to 3%. Both methods to estimate the entropy are good predictors of the measured entropy, 
apart from the region marked with the white boxes where the low accuracy method overestimated 
the entropy. 

In a similar manner we compared in Fig. 10c the measured Q and the predicted from the 
high accuracy model. Again the model is able to reproduce within 20% or better the measured Q. 
As discussed before the differences can be ascribed mainly to the difference between the sampling 
entropy and the expected entropy ([Maris et al. (2000)]) and the not-ideal behavior of the compres- 
sor ([Maris et al. (2000)]). In general the effect of the sampling entropy would result in a higher 
Cr than expected while non idealities in a lower C^. Different ways can be used to calibrate these 
effects, however their interplay with the statistic of the signal is complicated and it is preferable to 
use 0CA2K to fine tune the REBA parameters optimized by analytical mean, given that in general 
the corrections required to properly tune with respect to the analytical prediction q are at most of 
about a factor of two. 

The numerically refined optimal parameters are reported in the third column of Tab. 2c, as 
is evident the only variation is just for q, the reason is explained by the contour plots in Fig. lOd 
and Fig. lOe which compares the predicted analytical Fjiff and Fsky with the Fdiff and Fsky obtained 
for the numerically refined parameters. Very good agreement is obtained in the location of the 
peaks which determines the optimal r\, ri and in turn the O. For completeness the last column of 
Tab. 2d reports the theoretically estimated (Sack, £q,m, fg.sky and e^joad after replacing the analytical 
optimal q with the numerical one. The very good agreement between the theory and the experiment 
is evident. 

As expected the quantization error for the differentiated data is smaller by about a factor of four 
than the error for sky or reference-load and anyway the error will be a fraction of ADU, but larger 
or comparable to the quantization error introduced by the ADC converter, which for A^aver - 52 is 
equivalent to ~ 0.04 ADU. 

Eq. (1.24) expresses the processing error for an univariate normal distribution as a function of 
the crlq ratio, but from Sect. 1.5 it is evident that in the present case the criq ratio is not a good 
measure of the processing error. At the opposite, it is possible to define an effective ajq ratio in 
terms of the processing error as 
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which expresses the number of independent quantization levels which could be accommodated 
within Icr. So this ratio gives an idea of how well the histogram of the differentiated data is sampled 
assuming it could be represented by an univariate normal distribution ^. A proper sampling would 
assure at least (cr/g)eff > 2 which is the case for this work as it is evident from Tab. 2c having 
{cr/q)e« > 6. 

Before concluding this comparison it is worth commenting the way in which the experimental 
Cr is reported in Tab. 2c. This is best done by looking at Fig. lOf where the histogram for the of 
the 111 packets produced in the test is shown. Given the true Q is a random variable, varying from 
packet to packet we take the 5% and 95% percentiles assessing that in less than 5% the Cr will be 
respectively smaller or larger than the quoted as well as the mean and the median (not quoted in 
the figure) of the measured Q. In this case it is evident that the target C^tlA in something less than 
half of the packets. So it would be better to introduce some safety factor, requiring for example 
cj^' ~ 2.6 or to require that the median be 2.4 or better with the 5% percentile to be ^ 2.4. 

Fig. 1 1 is representative of the results of the calibration for a whole set of 44 detectors per- 
formed during the Planck/LFI CSL test campaign in 2008 [Cuttaia at al.(2009)]. Data have been 
collected over two acquisitions, the first one being used for the calibration itself and the second one 
to verify the calibration performances. The environmental set-up and the onboard electroiucs were 
kept in a stable state during both acquisitions. During the first acquisition, called "calibration run", 
the on-board computer was configured to apply just the downsampling step to the data but skip- 
ping mixing, requantization and compression. The acquired data have been ingested into 0CA2K 
to generate a list of optimized processing parameters for a target = 2.4. Having produced a set 
of parameters for the REBA, the second acquisition, the "verification run", has been run while the 
instrument was set up to acquire data in nominal conditions, by using the same steps that are going 
to be used during flight i.e. mixing, requantization and compression followed by ground process- 
ing: decompression, demixing and dequantization. At the same time data have been also acquired 
in the raw format used in the "calibration run". So for each detector couples of data streams with 
and without on-board processing were obtained which have been compared in order to measure 
the processing error, following a procedure similar to the one described in [Fraillis et al. (2008b)]. 

Fig. 1 1 compares the mean Q, £q,sky/cr^ky, e^^.ioad/cioad, e^.difr/o-diff , where (Xdiff is the r.m.s. for 
the differentiated data. Bars in light colours are for results obtained by processing the data taken 
in the calibration run with the 0CA2 simulator. Bars in dark colours are the results from the data 
processed by the instrument in the verification run. In both cases the same set of optimal REBA 
parameters have been used. There is good agreement between the two runs, despite the presence 
of a few systematics. Such differences are due to slight changes in the environmental conditions 
between the two runs Differences in channels belonging to the same radiometer (00 and 01, 10 
and 1 1) are due to the fact that in the warm back end the two channels go through separate acquisi- 
tion lines, each of them being characterized by different noise properties [Bersanelli et al. (2009)]. 

^The (o"/(?)eff could be used to characterize the processing in the case the value of £q^sky and £q^\oad is not relevant. 
^In the CSL tests the satellite has been kept within a large cryogenic vacuum chamber which was not as stable as the 
L2 environment is. 



Such differences are usually small, e.g. detectors (00, 01) and (10, 11) of Feed-Horn #19. In a 
few cases however larger differences occurs, e.g. detectors (10, 11) of Feed-Horn #25. Again, 
the relative processing errors for sky and reference-load are very similar, and in 95% of the cases 
they are below 0.4 with some extreme deviations such as detectors 10 of Feed Horn #25 and 1 1 of 
Feed Horn #26 for which eg,sky/o"sky ~ f<?,ioad/o"ioadil- Here the optimal ^opt is not peculiar with 
respect to the values required for the other detectors, but optimal ri and r2 are very similar hav- 
ing \r2 -ri\ = 0.04 which is the resolution of the search grid in the (ri,r2) space. Such relativelly 
"coarse" resolution in the search grid for optimal (ri,r2) was imposed by constraining the need to 
optimize the REBA parameters within a few minutes after the data acquisition. The coarseness of 
the (ri,r2) grid is also the reason for the apparent coincidence of the mean values of ri and r2 for 
different frequency channels in Tab. 3. In flight such time constraints will be removed allowing a 
multi step iteration of the optimization procedure and the use of a thinner grid. However, even in 
those extreme cases, the coorrelation between sky and reference-load processing errors leads to a 
much smaller error for the differentiated data. In such cases the final processing error is always less 
than 3.8% of the instrumental white noise.. Scaling those numbers to the calibrated sensitivity per 
sample and per detector the calibrated processing error, AT^, was derived and is reported in units of 
yuK per sample and per detector in line 10 of Tab. 3. On average Ar^ is below the 3 juK level taken 
as a threshold for systematics [Mandolesi et al. (2009)] apart from for the case of the detector 01 
of the Feed-Horn #24 for which ATq ^3.2 ^iK. 

The values for the optimal REBA parameters are mainly determined by the frequency of the 
radiometric channel with some dispersion from detector to detector. Tab. 3 gives representative 
median values for ri , r2, ^ = l/Sq from the CSL tests as well as for the quantities in Fig. 1 1 and the 
resulting data rate. O is omitted since it is the most variable parameter and it has no significative 
impact on €g and Q . Tab. 3 reports also the number of detectors for each frequency channel, the 
A^aver valucs which are kept constant, the compressed data rate per detector, per frequency channel 
and for the instrument as a whole. Quantities are reported in the form x±6x where 5x represents 
the standard deviation taken as a measure of the internal dispersion of x within the given subset of 
detectors. It has not to be interpreted as an error and it must not be propagated. 

The total data-rate in Tab. 3 is just 7% higher than the target data-rate 35.5 bits/sec. Again 
this departure is mainly due to the limited resolution in the search grid as well as small changes in 
the environmental conditions between the two runs. In order to cope with this problem it is likely 

"Jot 

that during operations a safer Q ° = 2.5 target will be set in place of the nominal 2.4. 

Finally it is worth to consider the gain in the accuracy of the REBA optimization obtained by 
the complex procedure described in Sect. 2 with respect to the fairly simple scheme used in the 
earlier RAA test campaign [Bersanelli et al. (2009)]. During the RAA tests a simplified algorithm 
had been applied based on the fact that putting ri = r the processing error for the differentiated data 
reduces to Eq. (1.59) which is independent of ri and r2. Hence, the only free parameters where q 
and r2 + r. The optimization was performed by imposing Q = 2.4 and selecting those parameters 
for which e^^sky ~ e^.ioad ~ £q,m- Even in this case the required Cr = 2.4 was achieved but eg^iff/crdiff 
was between [0.08,0.14], when compared to the current eq,mlcrm < 0.038 it is evident how this 
procedure represents a substantial improvement. In particular LFI has as a target of keeping all of 
the instrumental systematics and non gaussian noises in the differentiated data below 10% of the 
instrumental white noise [Bersanelli et al. (2009)]. The optimization scheme described here allows 
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the reduction of the processing error on the differentiated data by a factor of four pushing it below 
this ambitious target. 



4. The impact of the on-board processing noise on the Planck scientific performances 

5. Final Remarks and Conclusions 



Acknowledgments 

Planck is a project of the European Space Agency with instruments funded by ESA member states, 
and with special contributions from Denmark and NASA (USA). The Planck-LFl project is devel- 
oped by an International Consortium lead by Italy and involving Canada, Finland, Germany, Nor- 
way, Spain, Switzerland, UK, USA. The Italian contribution to Planck is supported by the Italian 
Space Agency (ASI). We thank the support of the Spanish Ministry of Science and Education. 



A. Approximation of the bivariate entropy 

This section presents the approximation of the entropy for an interlaced bivariate distribution for 
the two limiting cases of a uniform distribution or a normal distribution. 

The case of a uniform distribution 

For the case of two uniform distributions are taken the intervals where the distributions are not 
null as 2ij < Qi < Qi,r and 02,1 < Qi < Qi,^ Also are defined the widths A^i = Qij - Q\,\ and 
A^2 = G2,r - Q2,\, and the centers Qi = (gi,! + Qu)/2, Qi = {Q2,\ + G2,r)/2. Without any loss of 
generality A^i < N2 is assumed. The entropy is a Unear function of |A| = |(22 - Gi | bounded between 
the lower limit 



2\ N2j 2 \Ni N2I \Ni N2 
for the case the two intervals are completely overlapping, and the upper limit 

Hoc = log2 Ni + log2 N2 + I, (A.2) 
for the case of complete separation of the two distributions. So that 



H = 



( lAI ^ ^2-^1 TJ 

1^1 < — 2 — ' 

^ < |A| < (/f^-H^„)M±^+//^n . (A.3) 

^ < |A|, Hoc 



The case of a normal distribution 

Witout any loss of generality, two sources of normal-distributed interlaced signals having respec- 
tively variances equal to 1 and cr^ > 1, quantized with a quantization step ^ < 1 are considered. 

It is necessary to find an approximation for if(Adistr,9,cr). This could not be derived analyti- 
cally, but in a manner similar to the case of the uniform distribution the entropy is bounded between 
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a lower limit, /fmin. and an upper limit Hoo, while varying Adistr- In Fig- 12a shows how the entropy 
varies as function of Adistr and for three different values of cr. In addition both H, Hca and H^m 
are proportional to -logj^ so that their differences does not depend on q. For this reason it is 
convenient to define the Normalized Entropy, h 

1 ^ ~ /A ^ X 

1 = ; (A.4) 

M — M 
^min ^oo 

which is will be just a function of Ajistr and cr as shown in Fig. 12b as full lines for three values of 



cr. 



Having h, H - Hoo and q we can readily estimate H with 



H = Hoo-log2q + iHoc-Hjnm)h, (A.5) 

note that we do not need to estimate /fmin, and that Hc<, could be readily estimated from Eq. (A.6) 
by putting crj = 1, cr = o-j/cti, and expressing q in units of crj. 

The difference {Hoo - /^min) ^ 1 bit is always positive and just function of cr. It is null in 
the limit cr — > +oo as shown in Fig. 12c. The figure shows as dots an approximation obtained 
numerically for I < cr2 < 4000 for which 



Hoo ^^min — exp 



(A.6) 



2]A„(log(r2)"+Ao 

ji=i 

with Ao = 1.0893 x lO'^, Ai = -8.3819 x lO'^, A2 = -2.3699 x 10"^ A3 = 4.8141 x lO'^, A4 = 
-5.1620 X 10-^ As = 2.1425 x lO""^ within an accuracy of ±1.1%. 

Even for h a numerical approximation is possible within a ±1% accuracy 

_ distr 

h^e 2°-' . (A.7) 

Here cr* is just a function of cr, and it is bounded between 0.25558 < cr^^ < 0.30797 with an ^ 21% 
variation, as shown in Fig. 12d. Given it is interested to have an upper limit for /; a simple approx- 
imation would be to take cr* = 0.30797 overestimating the entropy of at most ( V/fco -/^min)0.16 < 
0.16. However cr* is a function of (Hoo - /^min) and numerically we obtained 



cr* = V6.8497 X 10-2 + 2.5965 x IQ-HHco - Hmin) + e (A.8) 

with |e|< 2x10-2. 

With this approximation the typical accuracy in estimating the H is below 0.01 0.03 bits and 
the optimal q for a given /ftgt is promptly derived from 

log2 ^opt = (H^in - Hoo)h + H00- Higt, (A.9) 

within a relative numerical accuracy of about 3%. 

In short, the algorithm of optimization becomes: i.) given ri, r2, q, cri, 0-2, T\, T2 ii.) compute 
cr, Adisti, iii.) compute cr*, h, Hoo, Hoo - /^min, iv.) compute ^opt- 
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B. ADC quantization 



Throughout this work it is assumed that the ADC quantization is not relevant for our scopes. How- 
ever, it is worth to briefly recall its impact, in particular looking at the conditions at which the ADC 
noise could be neglected. 

The resolution, or quantization step of the ADC, ^adc is given by (Vmax - Vmin)/2^^ Volts/ADU. 
After averaging by A'aver samples ^adc is reduced by a factor 1 / V^aver- The eff'ect of ADC res- 
olution is to add in quadrature a non-Gaussian noise to the signals of RMS 1 / VT2 before av- 
eraging and 1/ yilA^aver after averaging. In addition the ADC itself adds a random read-out 
noise of ctadc ADU which after averaging is reduced to ctadc/ VMwer When combined these 
two noises the readout noise whose RMS is cr^on = Vl/12 + crADc before averaging and cTron = 
Vl/12 + o-ADc/ VA^aver after averaging 

When a signal of RMS ctq is input to the DAE a gain, G, is applied and then the measured 
RMS is 



depending on the ratio a^oalGao. The measured RMS will be dominated by the ADC noise or 
by the signal RMS. Signals whose RMS is comparable to the read-out noise are defined as weak 
signals. 

Of course in the case of weak signals the read-out noise is no more negligible when, as 
an example, the ctq has to be measured in order to estimate the T^y^. The same is true when 
the variation of the RMS of the signal tacking in account of variations of G has to be estimated 
[Cuttaia at al.(2009)]. 

In addition, given the 1/12 factor in front of the variance induced by the ADC contribution, the 
read-out noise could be dominated by the ADC noise when ctadc > 0.3. 

As a practical example if ctadc ~ 0.5 and ctq ~ 1 the a^a ~ 0.57 and the bias in estimating ctq 
win be ^ 15%. 

C. DAE Tuning 

In an ideal scheme of operations, the various stages of a complex instrument such as Planck/LFI 
would have to be calibrated sequentially, so that the calibration of the REBA parameters would 
be the last step of the calibration procedure [Cuttaia at al.(2009)] and would have no eff'ect on 
the previous stages of calibration. Practical experience has shown that there is a case in which 
the tuning of the acquisition electronics has consequences on the subsequent tuning of the REBA 
parameters. Indeed, the hypothesis at the root of the whole compression scheme is that the noise 
variance of the input signal is large. This is in general true but this hypothesis could fail if the 
variance of the signal after ADC quantization, on-board coadding and mixing becomes too small. 
In that case the signal will be over compressed with Q > cj^' and the quantization error will be 
larger or equal to the signal variance. To avoid this case either the DAE gain G and A^aver have to 
be properly tuned, or a set of particular combinations of ri , r2 values has to be excluded. 

The problem is to ensure (t\ and 0-2 to be greater than a minimal crtgt typically assumed to be at 
least 2 adu in a suitable range of ri, r2 values. From Eq. (1.34) it is evident that the cr^ as a function 




(B.l) 
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of r, defines two identical concave parabolas with a minimum in ri = r2 = r^m = Cskyjoad/cfoa^j- 
where both ai and 0-2 takes the value 



'^min — f^sky 



^Jl^l, (C.l) 



where £)si is the correlation coefficient between sky and reference-load. Note that CTmin = just as 
in the case of a perfect correlation between sky and reference-load. So a sufficient condition to 
asses proper DAE calibration is 

CTtgt < cr s]jy 

Vl^' (C-2) 

which puts a constraint on the minimum G/ V^avei- which could be accepted. In particular assuming 
the quantization and the readout noise are small with respect to the sky and reference-load RMS, 
at first order 

(C.3) 



where cTsky q is the sky RMS with G = 1 and no averaging. 

It could happen that in some cases the condition (C.3) can not be full-filled for any reasonable 
value of G and A^aver- So a forbidden region in the ri, r2 space is defined by the need to have 
CTtgt < min(cri,cr2). This defines a "cross" centered into ri =r2 = r^i„ = cr s^y,ioad/<^ig^^, see Fig. 13, 
with "harms" parallel to the two axis of the ri , r2 space and having for each harm a width give by 

A'- = 2^^/|^) -il-gl) (C.4) 




DAE calibrators could monitor the evolution of Ar as G/ V^aver varies. In general the optimization 
of ri and r2 is performed by scanning a rectangular region in the {r\,r2) space of limited width. 
An informative parameter to avoid to harm a proper REBA calibration after DAE calibration is to 
check the fraction of area of the region of interest excluded by the DAE calibration /dae.bxcI- It is 
not possible of course to write a general formula for all the possible cases, but often is small 
so that the excluded region has a center near r\=r2= 0, while the optimization region is squared, 
centered on the origin with -riim <r\,r2< +nim in that case 

(4riitn-Ar)Ar 

/DAE.excl = 7-5 ■ (Co) 

Arf. 

lim 
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Table 2. Table for the optimization example illustrated in Fig. 10. Subtable A) reports sky and reference- 
load statistics together with their correlation, and the derived r and Ra-. Subtable B) reports the optimized 
REBA parameters, obtained respectivelly with the analytical approximation, and subsequently with a nu- 
merical scan of the parameter space. The processing statistics are reported in the Subtable C) where the 
processing errors and the Cr expected from the analytical model are compared with the ntmierical results. 
The third column of the subtable gives the theoretical expectations after replacing q obtained from theory 
with q obtained by numerical means. 



A) TOI Statistics 





sky 


reference-load 


Combined 


Mean [adu] 


12041.29 


12313.63 




RMS [adu] 


9.72 


10.06 




Slope [adu/sec] 


0.026 


0.027 










0.9988 


r 






0.9779 








0.9659 


RMS(Ar) [adu] 






1.45 




B) Optimized Parameters 






Analytical 


Numerical 




rl 


1.25 


1.25 




rl 


0.83 


0.83 







785.41 


784.39 




q 


0.203 


0.317 






C) Processing Statistics 










Analytical 








with 




Analytical 


Numerical 


numerical q 


Npck 


97 


111 


_ 


Adistr 


479.1 


478.9 


- 


CTi [adu] 


3.291 


3.292 


_ 


0-2 [adu] 


1.885 


1.884 


- 


e^.difF [adu] 


0.043 


0.067 


0.068 


eq.sky [adu] 


0.211 


0.330 


0.329 


eq,\oad [adu] 


0.199 


0.310 


0.311 


(o-/q)ef£ 


9.7 


6.2 


6.2 


maxCIQackl) 


0.388 


0.228 


0.248 


Hjot bits 


6.667 


6.023 


6.021 


Mean H bits 




5.489 




Mean ?7oca2k 




0.828 




MinCr 




2.286 




5% Cr 




2.333 




Median Cr 




2.413 




MeanCr 


2.4 


2.414 


2.657 


95% Cr 




39- 2.469 




MaxCr 




2.510 




RMSCr 




0.045 
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Figure 11. Results for a typical session of REBA parameters tuning during the CSL test campaign. From 
top to bottom the figure reports for each detector the mean Cr, Qy.sky/csky, e^.ioad/cioad, e£/,diff/o"difF where 
cTdiff is the rm.s. of the differentiated data. The red line in background of the top frame denotes the target 
Q*^' = 2.4. Values are represented by bars. Light-bars are the results from the calibration phase, where 
raw data from the instrument are processed by 0CA2. Dark-bars are results from the verification phase, 
where processing is performed on-board. The secctOWrame from Top gives an example for detector 00 
of Feed-Horn #19. Feed-horns are numbered according to the internal Planck/LFI convetion assigning at 
Planck/LFI the Feed-Horns numbers from #18 to #28. Detectors belonging to the same Feed-Horn are 
grouped together as shown in the third frame from top. 



Table 3. Representative REBA Parameters, the measured and relative processing errors from tlie CSL 
tests for Planck/LFI. Detectors are grouped by frequency channel, for each quantity x the table reports its 
group median and group standard deviation 6x as a measure of the group internal dispersion, 6x must be not 
considered as an error. 





30 GHz 


Frequency Channel 
44 GHz 


70 GHz 


Detectors 


8 


12 


24 


^aver 


126 


88 


53 


n 


1.042 + 0.032 


1.042 ±0.024 


1.042 ±0.012 




0.917 ±0.065 


0.917 ±0.025 


0.958 ±0.020 


q [adu] 


0.297 ±0.034 


0.198 ±0.044 


0.279 ±0.048 




2.400 ±0.024 


2.440 ±0.019 


2.380 ±0.023 




0.420 ±0.278 


0.269 ±0.184 


0.177 ±0.063 




0.432 ±0.267 


0.271 ±0.183 


0.178 ±0.063 




0.0341 ±0.0016 


0.0369 ±0.0010 


0.0351 ±0.0010 


AT, [mK] 


1.759 ±0.148 


2.412 ±0.356 


1.905 ±0.287 


Data Rate per Detector [bits/sec] 


454.9 ±4.1 


640.8 ±4.7 


1108.2 ±9.8 


Data Rate per Frequency Channel [bits/sec] 


3641.8 


7689.9 


26600.3 


Total Data Rate [bits/sec] 




37932 
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Figure 12. Frame a: Entropy variation for a couple of normally distributed signals as a function of Adisa 
for three values of cr. The reference entropy is //oo. Frame b: Normalized entropy for a couple of normally 
distributed signals as a function of Ajistr for three values of cr. Full lines: numerical integration. Frame c: 
Normalized entropy difference Hoo - //min as a function of cr for a couple of normally distributed signals. 
Full lines: numerical integration. Dots: approximated formula. Frame d: The cr, parameter as a function 
of cr for a couple of normally distributed signals. Full lines: numerical integration. Dots: approximated 
formula. 
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